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Preface 


This book is designed as an introduction to “abstract” algebra, particularly 
for students who have already seen a little calculus, as well as vectors and 
matrices in 2 or 3 dimensions. The emphasis is not placed on abstraction 
for its own sake, or on the axiomatic method. Rather, the intention is to 
present algebra as the main tool underlying discrete mathematics and the 
digital world, much as calculus was accepted as the main tool for continuous 
mathematics and the analog world. 

Traditionally, treatments of algebra at this level have faced a dilemma: 
groups first or rings first? Presenting rings first immediately offers familiar 
concepts such as polynomials, and builds on intuition gained from working 
with the integers. On the other hand, the axioms for groups are less complex 
than the axioms for rings. Moreover, group techniques, such as quotients 
by normal subgroups, underlie ring techniques such as quotients by ideals. 
The dilemma is resolved by emphasizing semigroups and monoids along with 
groups. Semigroups and monoids are steps up to groups, while rings have 
both a group structure and a semigroup or monoid structure. 

The first three chapters work at the concrete level: numbers, functions, 
and equivalence. Semigroups of functions and groups of permutations appear 
early. Functional composition, cycle notation for permutations, and matrix 
notation for linear functions provide techniques for practical computation, 
avoiding less direct methods such as generators and relations or table look- 
up. Equivalence relations are used to introduce rational numbers and modular 
arithmetic. They also enable the First Isomorphism Theorem to be presented 
at the set level, without the requirement for any group structure. If time is 
short (say just one quarter), the first three chapters alone may be used as a 
quick introduction to algebra, sufficient to exhibit irrational numbers or to 
gain a taste of cryptography. 

Abstract groups and monoids are presented in the fourth chapter. The 
examples include orthogonal groups and stochastic matrices, while concepts 
such as Lagrange’s Theorem and groups of units of monoids are covered. The 
fifth chapter then deals with homomorphisms, leading to Cayley’s Theorem 
reducing abstract groups to concrete groups of permutations. Rings form 
the topic of the sixth chapter, while integral domains and fields follow in the 
seventh. The first six or seven chapters provide basic coverage of abstract 
algebra, suitable for a one-semester or two-quarter course. 

Subsequent chapters deal with slightly more advanced topics, suitable for 
a second semester or third quarter. Chapter 8 delves deeper into the theory 


1x 


xX 


of rings and fields, while modules — particularly vector spaces and abelian 
groups — form the subject of Chapter 9. Chapter 10 is devoted to group 
theory, and Chapter 11 gives an introduction to quasigroups. 

The final four chapters are essentially independent of each other, so that 
instructors have the freedom to choose which topics they wish to emphasize. 
In particular, the treatment of fields in Chapter 8 does not make use of any of 
the concepts of linear algebra, such as vector space, basis, or dimension, which 
are covered in Chapter 9. For a one-semester introduction to groups, one could 
replace Chapter 6 with Chapter 10, using the field of integers modulo a prime 
in the examples that call for a finite field. 

Each chapter includes a range of exercises, of varying difficulty. Chapter 
notes point out variations in notation and approach, or list the names of 
mathematicians that are used in the terminology. No biographical sketches are 
given, since libraries and the Internet can offer much more detail as required. 

A special feature of the book is the inclusion of the “Study Projects” at the 
end of each chapter. The use of these projects is at the instructor’s discretion. 
Some of them may be incorporated into the main presentation, offering typical 
applications or extensions of the algebraic topics. Some are coherent series 
of exercises, that could be assigned along with the other problems, or used 
for extra credit. Some projects are suitable for group study by students, 
occasionally involving some outside research. 

I have benefited from many discussions with my students and colleagues 
about algebra, its presentation and application. Specific acknowledgments are 
due to Mark Ciecior, Dan Nguyen, Jessica Schuring, Dr. Sungyell Song, Shibi 
Vasudevan, and anonymous referees for helpful comments on a preliminary 
version of the book. The original impetus for the project came from Bob 
Stern at Taylor & Francis. I am grateful to him, and the publishing staff, for 
bringing it to fruition. 


Chapter 1 


NUMBERS 


Algebra begins as the art of working with numbers. The integers are the 
whole numbers, positive, negative, and zero. Put together, they form the set 

Z = {...,—2,—1,0,1,2,3,...} (1.1) 
(the letter Z coming from the German word 3abhlen, meaning “numbers” ). 
The natural numbers are the nonnegative integers, including zero. They are 
“natural” because they are the possible numbers of elements in a finite set. 
For example, 4 is the number of elements of the set 


{@,9, >, do} (1.2) 
of suits in a deck of cards, while 13 is the number of elements of the set 
{AY, KY, QY, JY, 109, 99, 89, 79, 69,59, 49, 39, 29 } (1.3) 


of cards in the suit Y of hearts. Note that 0 is the number of elements in the 
empty set @ or { }. The natural numbers form the set 


N = {0,1,2,3,...}. (1.4) 


Another set of numbers familiar from calculus is the set R of real numbers, 
like —17, /2 = 1.41421..., e = 2.71828..., 7 =3.14159..., and so on. It is 
hard to display the set of real numbers as a list of elements between braces, 
like the sets (1.1)—(1.4) above. Instead, the set R is pictured as the real line 


| | | | | | | | | 
cae 
I I I I I I I I I 


4 3 2 1 0 1 2 3 4 


(like an axis in the graph of a function). Pictures like this are useful as 
geometric visualizations of real numbers. At times similar pictures can even 
be useful for natural numbers or integers, since these numbers also happen to 
be real numbers. 


1.1 Ordering numbers 


In calculus, order relations between real numbers are crucial, for instance 
when we want to find the maximum value of a function over a certain range. 
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Recall that x < y (read “x less than y”) means y — z is positive, while x < y 
(read “a less than or equal to y”) means that y — x is nonnegative. We can 
also write y > x (“ax greater than y”) instead of « < y, or y > x (“ax greater 
than or equal to y”) instead of « < y. In the real line picture, with the positive 
numbers going off to the right, the relation « < y becomes an arrow x —> y. 
It is often helpful to signify the relation « < y with an arrow from 2 to y, 
without requiring the arrow to go horizontally from left to right. 

Since algebra also needs to work with order relations between numbers, it 
is important to know the rules for manipulating them. The first rule is called 
reflexivity: 

Le (1.5) 


for any real (or integral, or natural) number x. This particular rule doesn’t 
seem to be saying very much, but it often serves as a place-holder. The second 
rule is transitivity: 


(asy and yee) implies x<z (1.6) 


for any real (or integral, or natural) numbers «, y, and z. If Xavier can’t beat 
Yerkes, and Yerkes can’t beat Zandor, then Xavier can’t beat Zandor either. 
Why does (1.6) hold? Well, if « < y and y < z, the quantities y—a and z—y 
are nonnegative. In that case, so is their sum z — x, meaning that x < z. 
Transitivity makes a natural arrow picture: 


y y 
ca \ implies y XS 
x z ev ——> z 


... “completing the triangle.” The final rule for the order relation is the one 
that yields conclusions of proofs, when you want to show that two numbers 
are actually equal: 


(c<y and y<ax) implies r=y (1.7) 
for real numbers x and y. This rule is called antisymmetry. If Xavier can’t 


beat Yerkes, and Yerkes can’t beat Xavier either, then Xavier and Yerkes will 
tie. 


Rules for an order relation 


(R) Reflexivity: ua 
(T) Transitivity: x<y and y<z imply w<z 


(A) Antisymmetry: «<y and y<a imply w=y 
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As an illustration of the use of the rules, here’s a proposition with its proof. 


PROPOSITION 1.1 (Squeezing. ) 
Suppose x, y, and z are real numbers. Ifa <y<z< a, thenax =z. 


PROOF § Since x < y < z, transitivity shows that « < z. But also z < a, 
so antisymmetry gives x = z. 


1.2 The Well-Ordering Principle 


Compare (1.1) with (1.4). The elements of Z in (1.1) stretch off arbitrarily 
far to the left inside the braces: There is no smallest integer. In a version 
of the schoolyard game “My Dad earns more than your Dad,” consider two 
players trying to name the smaller integer. Whatever number the first player 
names, say —10,000,000, the second player can always choose —10, 000,001 
or something even more negative. With the natural numbers, the situation is 
different. It is summarized by the following statement, the so-called 


Well-Ordering Principle: 
Each nonempty subset S of N has a least element inf S. 


(Compare Exercise 7. The mathematical notation inf S stands for the infimum 
of S.) Of course, the principle is only required for infinite subsets S. For finite 
nonempty subsets 5S, the least element inf S', in this case often denoted as the 
minimum min S, can be located easily (Project 2). 


Example 1.2 (An application of the Well-Ordering Principle.) 

Suppose S = {n € N | 10” < 4n”}, the set of natural numbers n for which 
the power 10” is less than half the power n”. The set S is nonempty, indeed 
infinite, since as n increases beyond 10, the power n” grows faster than 10”. 
(Formally, limy_...($n” /10") = co.) The Well-Ordering Principle guarantees 


that S has a least element inf $. You are invited to find it in Exercise 5. [J 


In one of its main applications, the Well-Ordering Principle underwrites 
the techniques known as recursion and mathematical induction. For example, 
consider the definition of the factorial n! of a natural number n. This quantity 
is usually defined recursively as follows: 


ol=1, (n+ 1)!=(n+1)-n! 
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How can we be sure that the definition is complete, that it will not leave a 
quantity such as 50001200! undefined? 

For generality, consider a property P(n) of a natural number n, say the 
property that n! is defined by the given recursive procedure. 


e The Induction Basis is the statement that the property P(0) holds. 


e The Induction Step is the statement that truth of the property P(n) 
implies the truth of the property P(n+ 1). 


e The Principle of Induction states: The Induction Basis and Induction 
Step together guarantee that P(n) holds for all natural numbers n. 


To justify the Principle of Induction, suppose that it goes wrong. In other 
words, the set 


S ={n| P(n) is false } 


is nonempty. By the Well-Ordering Principle, the set S has a least element 
s. The Induction Basis shows that s cannot be 0. Thus s > 0, and s—lisa 
natural number. Since s — 1 does not lie in S, the property P(s — 1) holds. 
The Induction Step then gives the contradiction that P(s) is true. Thus the 
Principle of Induction cannot go wrong. 


Example 1.3 (A model proof by induction.) 
Let P(n) be the statement that the identity 


n(n + 1)(2n + 1) 
6 


17+ 27437 +...4¢(n-1? +n? = (1.8) 
holds for a natural number n. As Induction Basis, note that (1.8) reduces to 
the triviality 0 = 0 for n = 0, so P(0) is true. For the Induction Step, suppose 
that P(n) is true, so that (1.8) holds as written. Then 


1? + 2? +37 + (n= 1)? +n? + (n +17 
7 n(n + Hen +1) + (n+) 
_ n(n+1)(2n + 1) + 6(n + 1)? 
a 6 
_ (n+ 1)(2n? + 7n +6) 
-_ 6 
_ (w+ 1)(n + 2)(2(n +1) +1) 
= : 


so that P(n +1) is true. This proves (1.8) by induction. (] 
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1.3 Divisibility 

The set Z of integers is a subset of the set R of real numbers; so integers can 
certainly be compared using the order relation < for real numbers. However, 
in many cases a different relation between integers is more relevant. This is 
the relation of divisibility. Given two integers m and n, the integer m is said 
to be a multiple of n if there is an integer r such that m= r-n. For example, 
946 is a multiple of 11, since 946 = 86-11. Even integers are the multiples 
of 2. Zero is a multiple of every integer. Turning the relationship around, an 
integer n is said to divide an integer m, or to be a divisor of m, if m is a 
multiple of n. Summarizing, 


n divides m is equivalent to m is a multiple of n |. (1.9) 


The statement “n divides m” is written symbolically as n | m. 

It is useful to compare the two equivalent concepts of (1.9). Divisibility is 
most convenient for formulating mathematical claims. On the other hand, it 
is generally easier to prove those claims by working with the corresponding 
equation m = r-n from the relation of being a multiple. As an example, 
consider the proof that the divisibility relation | on Z shares the reflexivity 
(R) and transitivity (T) properties of the relation < on R (page 2). 


PROPOSITION 1.4 (Divisibility on Z is reflexive and transitive.) 
Let m, n, and p be integers. Then: 

(R) m|m; 

(T) (m|n and n|p ) implies m | p. 

PROOF (R) For each integer m, the equation m = 1-m holds, so m is a 
multiple of m. 


(T) Since m | n, there is an integer r with n = rm. Since n | p, there is an 
integer s with p = sn. Then 


p=sn= s(rm) =(sr)m 
is a multiple of m, so m | p. l] 
However, the relation | on Z is not antisymmetric. For example, 5 | —5 since 
—5 = (-1)-5, and —5 | 5 since 5 = (—1)- (—5). Nevertheless, 5 4 —5. The 


situation changes when we restrict ourselves to natural numbers. We regain 
all three properties: reflexivity (R), transitivity (T), and antisymmetry (A). 
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PROPOSITION 1.5 (Divisibility on N is an order relation.) 


Let m, n, and p be natural numbers. Then: 
(R) m | m; 
(T) (m|n and n|p ) implies m | p; 


(A) (m|n and n|m ) implies m =n. 


The proof of Proposition 1.5 is assigned as Exercise 14. The proposition 
means that divisibility relations between natural numbers may be displayed 
with arrow diagrams, just like the order relations between real numbers. For 
example, the set 

{1,2,3,4,6, 12} 
of divisors of 12 is exhibited in Figure 1.1. The diagram explicitly displays 
divisibilities such as 3 | 6 with arrows: 3 —> 6. Other relations, such as 
3 | 12 or 4| 4, are implicit from the transitivity and reflexivity guaranteed by 
Proposition 1.5. 


~ 12 


~ 4 


FIGURE 1.1: The positive divisors of 12. 


1.4 The Division Algorithm 


To check whether a positive integer d divides a given integer a (positive, 
negative, or zero), a formal procedure known as the Division Algorithm is 
available. Given the 


input: a positive integer d (the divisor) and (1.10) 
an integer a (the dividend), (1.11) 
the Division Algorithm (Figure 1.2) produces the 


output: an integer gq (the quotient) and (1.12) 
an integer r (the remainder), (1.13) 
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satisfying the following: 


a=dq+tr; (1.14) 
O<r<d. (1.15) 


For example, given the divisor 5 and dividend 37, the algorithm produces 7 
as the quotient and 2 as the remainder: 37 = 5-7+2, with 0 < 2 < 5. Given 
divisor 5 and dividend —42, it produces —42 = 5- (—9) +3, withO <3 <5. 
In general, the dividend a is a multiple of the divisor d if and only if the 
remainder r is zero. 


dividend a quotient q 


divisor d remainder r 


FIGURE 1.2: The Division Algorithm. 


The word dividend in (1.11) means “the thing that is to be divided,” like 
the profits of a company being divided among the shareholders. The word 
quotient in (1.12) is Latin for “How many times?” (the divisor d has to be 
added to itself to approach or equal the dividend). Then the remainder r is 
what is left after subtracting q times the divisor d from the dividend a. 

The following proposition, with its proof, is a guarantee that the Division 
Algorithm will always perform as claimed. The proof relies on the use of the 
Well-Ordering Principle as presented in Section 1.2. 


PROPOSITION 1.6 

Given a dividend a as in (1.11), and a divisor d as in (1.10), there is a unique 
quotient q as in (1.12) and a unique remainder r as in (1.13), such that the 
equation (1.14) and inequalities (1.15) hold. 


PROOF Define a subset S of N by 
S={a-—dk|keZ, a—dk>0} (1.16) 


— the set all integers of the form a — dk in which k is an element of the set 
Z of integers, and such that the inequality a — dk > 0 is satisfied. 
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Claim 1: The set S is nonempty. 


If a > 0, then a— d-0 =a is an element of S. Now d is a positive integer, 
sod—1>0. Then if a < 0, we have a — da = (—a)(d— 1) > 0, as a product 
of two nonnegative integers. Thus a — da is an element of S in this case. 


With Claim 1 established, we can appeal to the Well-Ordering Principle. 
It tells us that the nonempty subset S of N has a least element inf S. Set 


r=infS. (1.17) 


Since r is an element of S, we have 0 < r, the left-hand inequality in (1.15). 
And again since r is an element of S, we know that it is of the form r = a—dk 
for some integer k. Set the quotient q to be the integer with 


r=a-—dq. (1.18) 
Adding dq to both sides of this equation yields (1.14). 
Claim 2: r < d. 


Could Claim 2 possibly be false? Could it happen that r > d? Well, if so, 
r — dis still a natural number. But by (1.18), 


r—-d=a-dq+l), 


so r — d would be a member of S strictly less than r. That would contradict 
(1.17), so the assumption that led to the contradiction, namely r > d, must 
be false. This shows that Claim 2 must be true, and verifies the right-hand 
inequality in (1.15). 


Claim 3: The integers g and r satisfying (1.14) and (1.15) are unique. 
Suppose a = dq’ +r’ for integers q’ and r’ with 0 < r’ < d. Nowr’ <r 
cannot be true, for otherwise we would have 0 < r’ = a — dq’ as an element 


of S less than r, the least element of S. Conversely, r < r’ cannot be true 
either, for then we would have q > q’, i.e., (¢—q’) > 0 and (q—q’) > 1, with 


Wart (r'—r)=rt ((a—dq’) — (a-dg)) =rt+dq-¢d)>d, 


in contradiction to r’! < d. Thusr=r’' andq=q. J 
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1.5 Greatest common divisors 


Let a and b be nonzero integers. A positive integer c is said to be a common 
divisor of a and b if it divides both a and b: 


c|aandc]|b. 


For example, consider the divisors of 72 displayed in Figure 1.3. It is apparent 
that 4 is a common divisor of 24 and 36. 


9 ~ 18 ~ 36 > 72 


3 - 6 12 » 24 
2 


> 4 ~ 8 


FIGURE 1.3: The positive divisors of 72. 


There are other common divisors of 24 and 36, such as 2 and 12. 


DEFINITION 1.7 (Greatest common divisor, relatively prime.) 
Let a and b be nonzero integers. 


(a) A positive integer d is the greatest common divisor (GCD) of a and b if 


e dis a common divisor of a and b, and 


e if cis a common divisor of a and b, then c < d. 


(b) The integers a and b are said to be relatively prime or coprime if their 
greatest common divisor is 1. 


For instance, 12 is the greatest common divisor of 24 and 36. The numbers 
8 and 9 are relatively prime. Note that 1 is coprime to every nonzero integer. 
Why should the greatest common divisor of two nonzero integers a and b be 
guaranteed to exist? Well, the set of common divisors of a and 0 is a finite set 
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S, the intersection of the finite sets of positive divisors of a and b. (Compare 
Exercise 11.) The greatest common divisor is then just the maximum element 
of the finite set S. Since each pair a, b of nonzero integers has a uniquely 
defined greatest common divisor, we may use a functional notation 


gcd(a, b) 
to denote that number. For example, gcd(24, 36) = 12. Note that 
gcd(a, a) = |al, (1.19) 
gcd(b, a) = gced(a, b), (1.20) 
and 
gcd(a, b) = gcd(—a, b) = gcd(a, —b) = ged(—a, —b) (1.21) 


for nonzero integers a and b (compare Exercise 26). 
The defining properties of the greatest common divisor of a pair of nonzero 
integers a and b may be summarized as follows: 
d= gcd(a,b) if and only if: 
e dl|aandd|b; (1.22) 
e (cl|aandc|b) implies c<d. (1.23) 


1.6 The Euclidean Algorithm 


Given nonzero integers a and b, how can we compute gcd(a, b)? By (1.21), 
it is sufficient to consider the case where a and 0 are both positive. By (1.19), 
it is sufficient to consider the case where a and 6 are distinct. And finally, 
by (1.20), it is sufficient to consider the case where a > 6. Then for positive 
integers a > b, the positive integer gcd(a,b) is produced by the Euclidean 
Algorithm. 

In fact, the Euclidean Algorithm is capable of more. Borrowing terminology 
from matrix theory or linear algebra, define a real number z to be an integral 
linear combination of real numbers x and y if it can be expressed in the form 


z=la+my (1.24) 


with integer coefficients | and m. Much of the significance of integral linear 
combinations resides in the following simple result, whose proof is assigned as 
Exercise 27. 


PROPOSITION 1.8 (Common divisor divides linear combination. ) 
A common divisor c of integers n and p is a divisor of each integral linear 
combination ln + mp of n and p. 
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The Euclidean Algorithm not only produces gcd(a,b), but if required 
may also be used to exhibit gcd(a,b) as an integral linear combination of a 
and b. Given integers a > b > 0, the algorithm works with a strictly decreasing 
sequence 

ry>T > >rg>+0+ > re > Te41 = 0 (1.25) 


of natural numbers. Following the initial specification 
rj=a and r=), 


the natural numbers (1.25) are produced by a series of steps. For 0 <i <k, 
Step (2) applies the Division Algorithm with r;_, as the dividend and r; as 
the divisor: 

e-1 = Git1li +Ti41, (1.26) 


obtaining rj, as the remainder with r; > riz; > 0 (and some integer qi+1 
as the quotient). The Euclidean Algorithm makes its last call to the Division 
Algorithm in Step (k), obtaining the remainder r;,4,; = 0. At that time the 
greatest common divisor gcd(a, b) is output as rz, the last nonzero remainder 
in the list (1.25). 

Why is rx = gcd(a, b), and how is rz, produced as a linear combination of a 
and b.? To answer these questions, it is helpful to rewrite (1.26) as the matrix 


equation 
ri-1} ita Ll | ori 
Palen can 


holding for 0 <i < k. (Compare Section 2.3, page 28, for a review of matrix 
multiplication.) Note that (1.27) is an equality between 2-dimensional column 
vectors with integral entries. Equality of the bottom entries is trivial, while 
(1.26) is the equality between the top entries. Now 


0 1 G41 1} — J1 0) | Jai 1 0 1 
ie ami 0 Re ores 0 


so (1.27) is equivalent to the matrix equation 


fal ~ fi | a (1.28) 


for 0 <i<k. Repeated use of (1.28) gives 


Tk ee, 0 1 0 1 PIL) 2 ps t i a | 
Thad Ll —qrsi} Jl -q} | ro | Juvl | ro 
for integers s and t (computed by multiplying the 2 x 2 matrices in the middle 


term), so r;, is expressed as the integral linear combination 


Th = sr_y+tro = sat+tb (1.29) 
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of a and b. By Proposition 1.8, any common divisor c of a and b is a divisor of 
rz, confirming that rz satisfies the requirement (1.23) for the greatest common 
divisor of a and b. Finally, repeated use of (1.27) gives 


a} |r-i]_ |Jql qe+1 1] | re | _ |s’ Ire 

b} | ro} {1 Of] 1 Of fresi} ful of} | 0 
for integers s’, t’, u’, and v’, so that a = s’/rz and b = u'rz. This means that 
rz, | a and r, | 6. Thus rz satisfies the requirement (1.22) for the greatest 
common divisor of a and b. 


Now we know that rz, = gcd(a,b), the import of the equation (1.29) may 
be recorded for future reference as follows. (Compare Exercise 28.) 


PROPOSITION 1.9 (GCD as an integral linear combination.) 


Let a and b be nonzero integers. Then the greatest common divisor gcd(a, b) 
may be expressed as an integral linear combination of a and b. 


Example 1.10 (A run of the Euclidean Algorithm.) 


Consider the determination of gcd(7,5) with the Euclidean Algorithm. The 
calls to the Division Algorithm are as follows: 


Step (0): 7=1-5 + 2 
Step (1): 5=2-2 4+ 1 
Step (2) : 2=2-1 + 0 


Thus gcd(7,5) emerges as 1, the remainder from the penultimate Step (1). 
The matrix equations (1.27) become 


B-bobl El-bd bl bl-bd ld: 


y-fR BPG. 
fae 


whence gcd(7,5) =1 = (—2)-7+3-5. (] 


Thus 


II 

| 
aa) 
| ww 
ne 
— 
oN 
Pees 
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1.7 Primes and irreducibles 


The positive number 35 can be reduced to a product 5-7 of smaller positive 
numbers 5 and 7. On the other hand, neither 5 nor 7 can be reduced further. 
In fact, if 5 = a-b for positive integers a and b, thena =1andb=5o0ra=5 
and b= 1. We define a positive integer p to be irreducible if p > 1 and 


O<d|p implies (d=lord=p) (1.30) 


for integers d. Irreducibility is an “internal” or “local” property of a positive 
integer p, only involving the finite set of positive divisors of p. 

Now look outwards rather than inwards. The positive number 35 may divide 
a product, without necessarily dividing any of the factors in that product. For 
example, 35 divides 7-10, but 35 does not divide 7 or 10. On the other hand, 
5 divides the product 7-10, and then 5 divides the factor 10 in the product. 
We define a positive integer p to be prime if p > 1 and 


p|a-b implies (p|a or p|b) (1.31) 
for any integers a and b. Primality may be considered as an “external” or 


“elobal” property of a positive integer p, since it involves arbitrary integers a 
and b. The two properties are summarized as follows: 


Properties of an integer p > 1: 
(internal) irreducible: 0<d|p implies (d=1lord=p) 


(external) prime: p|a-b implies (p|a or p|b) 


It is a feature of the integers that the internal concept of irreducibility 
agrees with the external concept of primality. 


PROPOSITION 1.11 (“Prime” = “irreducible” for integers.) 

Let p> 1 be an integer. 

(a) If p is prime, then it is irreducible. 

(b) If p is irreducible, then it is prime. 

PROOF (a): Suppose p is prime and 0 < d | p, say p = d’d for some 
positive integer d’. Then p | dd. Since p is prime, it follows that p | d’ or 


p |d. In the latter case, d | p and p | d, so d = p by antisymmetry. In the 
former case, the same argument (replacing d by d’) shows d’ = p. Then d = 1. 
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(b): Suppose p is irreducible and p | a- b, say ab = pk for some integer k. 
Suppose p does not divide a. It will be shown that p | b. Since p is irreducible, 
its only positive divisors are 1 and p. Thus gcd(p,a) = 1, for ged(p,a) = p 
would mean p | a. Using Proposition 1.9, write gcd(p, a) as an integral linear 
combination 

1=lIp+ma 


of p and a. Postmultiplying by b gives 


b = lpb + mab 
= Ipb + mpk = p(lb+ mk), 


so that p | b as required. ] 


With Proposition 1.11 proved, prime numbers (as in Figure 1.4) may be 
characterized equally well by either the irreducibility (1.30) or the primality 
(1.31). (See the Notes to this section on page 23.) 


2 3 5 7 ill 138 #17 «19 23 29 
31 37) «6410 480 470 5B OD LSC 
73 79 83 89 97 101 103 107 109 113 

127 131 137 1389 149 151 157 163 167 173 
179 161 191 1938 197 199 211 223 227 229 


FIGURE 1.4: The first 50 prime numbers. 


There is a traditional adjective for numbers which are not prime: 


DEFINITION 1.12 (Composite numbers.) An integer n is said to 
be composite ifn > 1, but n is not prime. 


Thus a number n > 1 is composite if it is not irreducible, i.e., if it has a 
nontrivial factorization n = a- b with integers 1 <a<nandl<b<n. 


1.8 The Fundamental Theorem of Arithmetic 


In Figure 1.3, the number 72 is displayed as the product 72 = 8-9 = 23-3? = 
2-2-2-3-3 of prime numbers. The latter product may be written with the 
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factors in various orders, such as 72 = 2-3-2-2-3 or 72 =2-3-2-3-2. But to 
within such reorderings of the prime factors, the factorization is unique. The 
Fundamental Theorem of Arithmetic states that every integer greater than 
1 has a factorization as a product of primes, unique up to reordering of the 
factors. 

The existence part of the theorem is stated and proved as follows. 


THEOREM 1.13 (Existence of factorizations.) 
Each integer n > 1 may be expressed as a product of prime numbers. 


PROOF Let B be the set of integers n > 1 which cannot be expressed 
as a product of primes. If the theorem is false, then B is nonempty. In that 
case, the Well-Ordering Principle says that B has a least element b. Since 
the integer 6 lies in the set B, it is not itself prime (or irreducible), so it has 
divisors gi and g2 with 

b= 9gig2 (1.32) 


and 1 < gi,g2 < b. Since the divisors g; and gp are strictly less than 8, 
the least element of B, they are expressible as products of primes. But then 
(1.32) expresses the integer b as a product of primes, contradicting its status 
as a member of B. Since falsehood of the theorem leads to an inevitable 
contradiction, we conclude that the theorem is true. l] 


Implicit in the proof of Theorem 1.13 is a method, however slow, to produce 
the factorization of a given integer larger than 1 as a product of primes. For 
example, consider b = 500, which factorizes as b = giga with gi = 50 and 
go = 10. Then g3 = 5-10 =5-2-5 and gg =2-5, so 500 = 5-2-5-2-5. If 
b is less friendly, e.g., b = 281957, then one has to try dividing b in turn by 
successive primes p = 2,3,5,7,11,... up to Vb (compare Exercise 36). 

We now state the uniqueness half of the fundamental theorem. 


THEOREM 1.14 (Uniqueness of factorization.) 
Suppose that p,,p2,.-.,Pr and qi,qQ2---,Qs are primes. Then if 


oe ee ee a (1.33) 


r= s, and each p, on the left hand side of (1.33) appears as a q,; on the right 
hand side of (1.33). 


To prove Theorem 1.14, we will use a subsidiary result, a “lemma.” 


LEMMA 1.15 
Suppose that p1,q1,42,---;4s are primes. Then if 


Di | Gi Go. 6Wes (1.34) 
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there is some 1 <j <8 such that p; = qj. 


PROOF Suppose that the lemma is false. Let S be the set of natural 
numbers s for which there are primes p1,q1,q2,---,@s with (1.34) holding, 
but where p; does not appear as any q; with 1 < 7 < s. Since the lemma 
is false, the set S is nonempty, and thus has a least element s. Consider 
D1, 41; 92;-+-;Gs as in (1.34) for this integer s. Now p; does not divide the 
product qi-q2:---'ds—1, for then the minimality of s in S would mean that p; 
shows up among q1, @2,---,Qs—1- Since p; is prime, and (1.34) holds, it follows 
that p, | qs. Since 1 < p; and q, is irreducible, p; = q,, in contradiction to 
the assumption. Thus the lemma is true after all. (] 


To complete the proof of Theorem 1.14, suppose (1.33) holds. Then 


fy | Gia e924 Aas 
By Lemma 1.15, there is some 1 < j < s such that p; = q;. Then 


P2°P3 + +++ * Pr = 1° +--+ Gj-1° Gj41 +++ Ws - 
By Lemma 1.15, p2 cancels with some q, from the right-hand side. Continuing 
in this fashion, the p; on the left of (1.33) are paired off with the qg; on the 
right. In particular, the number r of factors on the left-hand side of (1.33) 
agrees with the number s of factors on the right. 
The Fundamental Theorem of Arithmetic makes a connection between the 
two order relations < and | on the set N of natural numbers. Specifically, for 


distinct primes p), p2,..., pr and natural numbers ej, fi, €2, f2,..-, er, fr; 
py py... per | pi" pe... pf if and only if ey < fi, ..., er < fr. 


We conclude with an application of this idea. 


DEFINITION 1.16 (Least common multiple.) Let a and b be 
nonzero integers. The least common multiple lcem(a,b) of a and b is the 
minimum element of the set S = {m|m->0, a|m, b| m} of positive 
common multiples of a and b. 


Write max{e, f} for the maximum of integers e and f. The Fundamental 
Theorem of Arithmetic yields the following result. Its proof is assigned as 
Exercise 39. 


PROPOSITION 1.17 (Computing the least common multiple.) 


Let a= py - ps? +... per and b= pt . pi? -...: pl” for distinct primes p1, po, 


.., Pr, and natural numbers e1, fi, €2, fa;.--, er, fr. Then 


Iem(a, b) = pes - pmextea,fa} ‘ 


max{e,,f;} 


Leet Dp 


1.9 


1. 


N DD om Fe WwW 


10. 
11. 


12. 


NUMBERS 17 


Exercises 


Suppose x, y, and z are real numbers. If « < y < z < a, give a formal 
proof that y = z by use of transitivity and antisymmetry. 


. Suppose that 79, 21,...,@n are real numbers, with xp < 471 <--- < ap. 


If v7, < 20, show that ro = xz, forl <r<n. 


. Why is (1.7) true? 

. Why is (1.5) true? 

. Find the least element inf S of the set S from Example 1.2. 
. Find the smallest integer n for which 2” < nl. 


. Let S be a nonempty subset of N. Let s be an element of S. The 


intersection {0,1,...,5—1}MS denotes the set of elements of S less 
than s. 
(a) If the intersection {0,1,...,s—1}NS is empty, show that inf S = s. 


(b) If the intersection {0,1,...,s — 1}MS is nonempty, show that 
inf S = min({0,1,...,s-—1} 1S). 


. Prove . 
13 | 93 | 33 | | n? = n(n As 1) 
T T T T 9 
for natural numbers n. 
(a) Prove 
1 
14243+..¢n= Mth (1.35) 


for natural numbers n by induction. 


(b) Can you prove (1.35) directly, without using induction? 
Prove n < 2” for natural numbers n. 


Let m be a nonzero integer. 
(a) Show that n | m implies |n| < |m]|. (In words: each divisor of a 
nonzero integer is no greater than that integer in absolute value.) 


(b) If you are uncomfortable with absolute values, show instead that 
n|m implies n? < m?. 


(c) Conclude that the set of divisors of m is finite. 


Show that every integer divides zero. 
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. There are 36 inches in a yard, and 100 centimeters in a meter. 


(a) In how many ways can a piece of wood a yard long be divided into 
equal pieces whose length is an integral number of inches? 

(b) In how many ways can a piece of wood a meter long be divided into 
equal pieces whose length is an integral number of centimeters? 


Prove Proposition 1.5. [Hint: To prove the antisymmetry (A) that does 
not hold for divisibility on Z, consider the solutions x of the equation 
x? =1in Zand N_] 


Describe the divisibility relation | on the set R of real numbers. 
Consider running the Division Algorithm on the inputs a = 1 and d = 0. 


(a) For the set S' of (1.16), what is inf S? 
(b) Show that a unique remainder r is obtained, but that the quotient 
q is not unique. 


(c) Is Proposition 1.6 contradicted? 


Let d be a positive odd number. Show that for each integer a, there are 
unique integers g and r such that a = dq+r with |r| < d/2. In other 
words, each integer a can be approximated by a multiple of d to within 
an error of less than d/2. 


Consider the 16 x 11 rectangular array of 176 pixels in a display. 
10 
9 
8 
if 
6 
5 L_| 
4 
3 
2 
1 
0 


0123 45 6 7 8 9101112131415 


The pixels are located by their coordinates in the array, so that the 
bottom left pixel has coordinates (0,0), and the top right pixel has 
coordinates (15,10). The pixels are addressed by the numbers from 0 
to 175. The address of the pixel with coordinates (q,7r) is 


a=llq+r. 


19. 


20. 


21. 


22. 


23. 
24. 


25. 


26. 
27. 
28. 
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(a) What is the address of the pixel with the black square? 
(b) What are the coordinates of the pixel with address 106? 
Let d > 1 be a fixed integer, known as the base. To represent a given 
positive integer n as a sequence n = ngng_1...N2N1 Of digits in base d, 
with 0 <n; < d for 1 <i<k, consider the following algorithm: 
(a 
(b 
(c 
(d 


Initialize with qo=nandi=1; 

At Step (i), obtain q—1 = qd+n; with the Division Algorithm; 
Stop at Step (k) when q, = 0 ; 

Otherwise, replace 7 by 7+ 1 and return to (b). 


Show that n = ngd*-1 + ny_yd*®-? +... + nod + ny. 


Express the base 10 number 3817 as a hexadecimal (base 16) number. 
Use A= 10, B=11, C= 12, D= 13, E = 14, F = 15 for the digits 
above 9. 


In a certain state, persons under age 21 are not allowed into bars that 
serve intoxicating beverages. If 21 were read as an octal number (to base 
8), what would be the minimum age (to the usual base 10) of persons 
allowed into bars? 


In Figure 1.3: 


(a) Identify the set of positive divisors of 18. 
(b) Identify the set of positive divisors of 24. 
(c) Identify the set S of common divisors of 18 and 24. 


(d) Identify gcd(18, 24) as the largest element of the set S. 
Find all pairs of relatively prime positive integers less than 10. 


Show that 1 is the only positive integer that is relatively prime to every 
positive integer. 


In a gearbox, gear wheel A meshes with gear wheel B. The two rotate 
together many times. Gear wheel A has a teeth, and gear wheel B has b 
teeth. Show that each tooth of A meshes with each tooth of B at some 
time if and only if a and 0 are relatively prime. 


Prove the equalities (1.19), (1.20), and (1.21). 
Prove Proposition 1.8. 


Without appealing to the discussion of the Euclidean Algorithm, give 
a direct proof of Proposition 1.9. [Hint: Applying the Well-Ordering 
Principle, show that gcd(a,b) is the smallest member of the set S of 
positive, integral linear combinations of a and b.] 
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34. 


35. 


36. 
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Let c be a positive common divisor of two nonzero integers a and b. 
(a) Show that c divides gcd(a, 6). 
(b) Show that ged(a, b)/c = gcd(a/c, b/c). 


For nonzero integers a, b, and c, with c > 0, show that gcd(ac, bc) = 
gcd(a, b) +c. 


Let a and b be distinct nonzero integers. Show that the greatest common 
divisor gcd(a, b) can be expressed in infinitely many distinct ways as an 
integral linear combination gcd(a, b) = la + mb of a and b. 


Use the Euclidean Algorithm to determine gcd(109, 60), and to express 
it as an integral linear combination of 109 and 60. 


Show that 2n + 1 and 3n +1 are coprime for all natural numbers n. 


Show that the Euclidean Algorithm will make at most b calls to the 
Division Algorithm when it computes gcd(a,b) with a > b > 0. 


(a) In how many ways can 72 be expressed as an ordered product of 
three twos and two threes? 


(b) Interpret each such expression 72 = pyp2p3paps (with p; = 2 or 3) 
as a walk from 1 to 72 along the path 


1— pi > pip2 > pip2p3 — pip2p3P4 — P1p2p3paps = 72 


in Figure 1.3. 


(c) Conversely, show that each path from 1 to 72, following the arrows 
at each step, determines an ordered factorization. 


(a) Show that a composite number b has a prime divisor p with p < Vb. 


(b) Conclude that an integer n is prime if it is not divisible by any 
prime less than \/n. 


Factorize b = 281957 as a product of primes. 
Can you prove that n? —n-+ 41 is prime for each natural number n? 
Prove Proposition 1.17. 


For positive integers a and 6, show that an integer is a multiple of both 
a and 6 if and only if it is a multiple of lcm(a, b). 


Use the Fundamental Theorem of Arithmetic to obtain a formula for 
gcd(a,d), similar to the formula for lem(a, b) given in Proposition 1.17. 


For nonzero integers a, b, and c, show that gcd(a, bc) = 1 if and only if 
both ged(a, b) = 1 and gcd(a,c) = 1. 
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43. For positive integers a and b, prove a- b = gcd(a,b) - lem(a, 6) . [Hint: 
For natural numbers e and f, prove e+ f = min{e, f} + max{e, f}.] 


44. (a) Give an example of prime numbers py, p2 and natural numbers e1, 
fi, €2, fo such that 


lem (p§" p52, p!*p??) # pe ps ent ; 


(b) Why does this not contradict Proposition 1.17? 


45. (a) Let py = 2, po = 3,p3 = 5,...,p, be the first r primes. Show that 
N= pi-po-..-+Pr+1 


is not divisible by any of py, p2,...,Dr- 


(b) Applying Theorem 1.13 to n, deduce that there is a prime number 
Ds With pp < ps <n. 


(c) Conclude that there is an infinite number of primes. 
46. Let n be a positive integer. A positive integer d is said to be a unitary 
divisor of n if d divides n, and gced(d,n/d) = 1. In this case, n is said 
to be a unitary multiple of d. 
(a) Determine the unitary divisors of 72 and 1200. 
(b) Determine the least common unitary multiple of 18 and 45. 


(c) Show that there is no least common unitary multiple of 3 and 9. 
47. Consider a world in which the only positive numbers are the numbers 
1,5,9, 13,17, 21, 25, 29,... (1.36) 


of the form 4r+ 1 for r in N. Suppose that the numbers are only 
multiplied, not added. 


(a) Show that the product of two numbers from the list (1.36) also 
appears in the list. 


(b) Show that the numbers below 25 in the list (1.36) are irreducible 
within this alternative world. 


(c) Show that 9 divides 21-21, but 9 does not divide 21. 


(d) Conclude that in this world, the property of being prime is distinct 
from the property of being irreducible. 
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1.10 Study projects 


1. For a sport competition of your choice (say one season of a particular 
league), determine whether the transitivity rule (1.6) and antisymmetry 
rule (1.7) apply. 


2. Consider the problem of finding the minimum min S' of a finite set S' of 
natural numbers with n elements. Design a procedure to do this with 
just n—1 comparisons between pairs of elements from S$. As inspiration, 
look at the brackets for a single-elimination sport competition in a league 
with m members. (Compare Figure 1.5 for the case n = 6.) 


Avalon 
Avalon 
Bangor 
Dunbar 
Caspar 
Dunbar 
Dunbar 
Easton 
Easton 
Easton 
bye 
Easton 
Forfar 
Forfar 
bye 


FIGURE 1.5: Brackets for a competition. 


3. The number 946 is a multiple of 11. Also, the difference between the 
respective sums 9+ 6 and 4 of the odd-placed and even-placed digits of 
946 is (a multiple of) 11. Is this just a coincidence, or can you extend 
the observation to derive a quick way of recognizing multiples of 11? 


NUMBERS 23 


4. Discuss why Proposition 1.6 is needed. Why is it not enough to claim 
that your computer or calculator can produce a unique quotient q and 
remainder r if you give it a dividend a and (positive) divisor d? [Hint: 
Can your computer accept a very large integer? ] 


5. Speed of the Euclidean Algorithm. Exercise 34 gives a crude bound 
for the number of steps required by the Euclidean Algorithm. Can you 
improve on this bound? Or for any positive integer k, can you always 
find integers a > b > & for which the Euclidean Algorithm requires 
approximately b steps? 


6. Greatest common divisors. Consider the following method to find 
the greatest common divisor of positive integers a and b: 


(a) If a and b are even, remember gcd(a,b) = 2- gcd(a/2,b/2) and 
compute gcd(a/2,b/2) instead. (Compare Exercise 29.) 

(b) If say a is even and 0 is odd, remember gcd(a, b) = gcd(a/2, 6) and 
compute gcd(a/2, b) instead. 

(c) If a and b are odd, say a > b, remember gcd(a,b) = ged(a — 6, b) 
and compute gcd(a — b, b) instead. 


Use this method to compute greatest common divisors of pairs of large 
integers. How does this method compare with the Euclidean Algorithm? 
Can you adapt this new method to express gcd(a, b) as a linear combi- 
nation of a and 6? 


1.11 Notes 
Section 1.6 


Euclid (EvkAevdns) was a Greek mathematician living in the third century 
B.C. 


Section 1.7 


When discussing integers, it has been traditional to define a number p to 
be “prime” if it is irreducible. The proof of primality as we have defined it 
— Proposition 1.11(b) — is then known as Euclid’s Lemma. Historically, 
the distinct terminology for the internal and external properties emerged in 
the 19th century, as mathematicians started to consider other systems of 
factorization (for example the system of Exercise 47). In these cases, the two 
properties may no longer coincide. 
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Chapter 2 


FUNCTIONS 


Algebra, just like calculus, works with many different kinds of functions. In 
this chapter, we will learn how to specify functions, and how to compose them. 
We will also see how functions form mathematical structures: semigroups, 
monoids, and groups. 


2.1 Specifying functions 


Let X and Y be sets. Then a map or function f : X — Y or X S.Yisa 
rule that assigns a unique element f(x) of Y to each element x of X. In this 
context, the elements x of X are called the arguments of the function f, while 
the elements f(x) of Y are called the values of the function. As examples, 
consider the squaring function 


sq:Z—N (2.1) 
defined by sq(n) = n? for each integer n, or the absolute value function 
abs: Z +N (2.2) 


defined by abs(n) = |n| for each integer n. In a function f : X — Y, the 
set X is called the domain, while the set Y is called the codomain. Thus the 
domain of (2.1) is Z, while the codomain of (2.1) is N. Note that (2.1) is 
considered as different from the function 


sq: Z—3Z (2.3) 


with sq(n) = n?, since the two functions have different codomains. In general, 
two functions f : X — Y and g: Z — T are equal if and only if all three of 
the following conditions are satisfied: 


e The domain X of f equals the domain Z of g ; 
e The codomain Y of f equals the codomain T of g ; 


e The function values f(x) and g(x) agree on each argument x in X. 


25 
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The reason for including the domain and codomain in the specification of a 
function will become apparent in Section 2.5. 

A function f must be able to assign a function value f(a) to each argument 
x in its domain. For instance, we cannot have a function 


inv:R—R 
with inv(2) = 2~+, since this rule does not work for the element 0 of the 
domain R. On the other hand, elements y of the codomain of a function 
f :X —Y are not required to show up as actual function values f(x). While 
each natural number does occur as the absolute value of some integer, there 
are many natural numbers (such as 3) which are not the square of any integer. 
The only demand placed on the codomain is that it be big enough to contain 
all the function values that are generated. For example, we cannot set up a 
function 

sqrt: NN 


with sqrt(n) = /n for natural numbers n, since the function value V3 does 
not lie in N. But setting 
sqrit:N—R 


would be fine, since square roots of natural numbers are always real numbers. 

In summary, the domain should always be small enough to guarantee 
that the function rule will work on each element of the domain. On the other 
hand, the codomain should always be large enough to contain all the 
function values that occur. In a function f : X — Y, the set 


F(X) ={f(@) | a € X} (2.4) 


of function values of domain elements is called the image of the function. For 
example, the image of the squaring function (2.1) is the set 


{0,1,4,9, 16,25, 36,...} 


of perfect squares. 

It is sometimes helpful to be able to specify a function without naming it 
explicitly. To this end, we will denote the action of a function at the element 
level using barred arrows, e.g., sq: nt n?. Thus the squaring function (2.1) 
might have been specified as 


Z—N;nt n? 

without having to receive the (rather artificial) name sq. The barred arrow 
notation is especially helpful when we examine functions whose arguments or 
values are themselves sets (as in Section 3.2, for example). Suppose that A 
and B are sets. Then f : A — B will denote a function with domain A and 
codomain B, while f : A t+ B means that a certain function f takes the 
argument A to the value B. 
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Warning: In calculus, the notation “f(x)” is sometimes used to denote a 
function, for example when we speak of “the function sin(x).” In algebra, the 
notation “f(a)” is reserved for the value of a function f at an argument wz. 
Do not confuse functions with elements of their domains or codomains. 


2.2 Composite functions 


Consider two functions f: X — Y and g: Y — Z, where the codomain Y 
of f is also the domain of g. Then there is a composite function 


gof:X > Z;2+ 9(f(z)) 


whose domain is the domain of f and whose codomain is the codomain of g. 
For example, the squaring function sq : Z — Z of (2.3) may be composed 
with the absolute value function abs : Z — N of (2.2) to yield the function 


absosq: Z—> Nn |n?|. 


In fact, since |n?| = n? for any natural number n, this composite function 
abs o sq is the same as the original squaring function (2.1). 

Composition of functions f: X — Y and g: Y — Z may be illustrated by 
an arrow picture, strongly reminiscent of the picture of transitivity on page 2: 


Y 
f g 
—_ 


If you find yourself getting confused by a profusion of functions, it can be 
helpful to draw such pictures. 

Suppose that there are functions f: X —~ Y,g:Y — Z,andh: Z—T. 
These functions may be composed in two different ways: 


ho(gof):X >T;x2 h(go f(z)) 


and 

(hog)of:X >T;a (hog)(f(a)). 
However, 

h(go f(x)) = h(g(f(x))) = (ho g)(F(2)) 
for all x in X, so in fact we have the associative law 


ho(gof)=(hog)of (2.5) 


for X 
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2.3. Linear functions 


Linear functions form one of the most important classes of functions. For 
positive integers m and n, consider the set 
Ais va} 


of m x n real matrices. In particular, R} is the set of 2-dimensional real 
column vectors 
x = 
ED) 


with x1, 22 in R. Each 2 x 2 real matrix 


A- Ee | 


a21 422 


gives a linear function 


Bas R} = RI; ia ng be Tr | (2.6) 


x2 a2121 + A22%2 
or 
La(x) = Ax 


using matrix multiplication. Note that 


1 — {411 0 _ {412 
taf [on] sme tli] = [ea 


so the linear function L.4 determines the matrix A. 
Given a second matrix 
ties i a 


bar boo 


with a corresponding linear function Lg : x + Bx, the matrix product BA 
is defined by 


by1 by2} }a@11 @12 = 641411 + by2@21 611412 + bi 2422 (2 7) 
bo1 b22| |a@21 a22 621411 + be2G21 b21412 + be2G22 ; 
This apparently complicated formula is designed to make the equation 
Lea(x) = Dg o La(x) (2.8) 


true for all x in R4 (Exercise 3): Matrix multiplication tracks the composition 
of the corresponding linear functions. In particular, the associativity of matrix 
multiplication is a direct consequence of the associativity (2.5) of function 
composition. 
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2.4 Semigroups of functions 
Let X be a set. A map or function f : X — X from X to itself is often 
described as a self-map of the set X. In this context, the set X is sometimes 


called the base set for the function f: X — X. 
A set S of functions 


DEFINITION 2.1 (Semigroup of functions.) 
f:X — X with domain X and codomain X is said to be a semigroup of 
functions on the base set X if 

f and g in S imply gof in S. (2.9) 


We also say that the set S is closed under composition. 


If f is an element of a semigroup S of functions, the powers f” for positive 


integers n are defined recursively by f1 = f and f"t! = fo f. 
Here are some important examples of semigroups of functions. 


Example 2.2 (Self-maps.) 
For a base set X, define X~* to be the set of all functions from X to X. Then 
X~* forms a semigroup of functions on X. (For a justification of the notation, 
see Exercise 5.) 

Example 2.3 (Constant functions.) 

Let X be a set, and let Y be a subset of X. For each element y of Y, define 

a constant function 

ty: X aX rrey. 


Note that for each element x of X, and for y, z in the subset Y, we have 


Cz 0 Cy(a) = Cz(Cy(@)) = cz(y) = 2 = c2 (2), 
so that c, ocy = c,. Thus the set 
Cy = {c,| yin Y} (2.10) 
[ 


forms a semigroup of functions on X. 


Example 2.4 (Nondecreasing functions.) 
Recall that in calculus, a function f : R — Ris nondecreasing if x < y implies 


Then the set of nondecreasing functions forms a semigroup of 


f(x) < fly). 


functions on R (Exercise 6). 
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Example 2.5 (Real shifts.) 
For each real number 1, define the shift by r to be the map 


op: ROR terse. 
Note that for real numbers r, s, and x, we have 
0, 00,(“%) =r+(s+2)=(rt+s) +2 =0745(2), 


SO 0, 005 = Or+4s5. Thus the set © of shifts forms a semigroup of functions on 
R. For a positive integer n and real number r, the equation 


Oo} = Onp (2.11) 


holds (Exercise 9). (] 


Example 2.6 (Computable functions.) 

Define a function f : N— N to be computable if there is a computer program 
producing f(n) as output whenever a natural number n is given as the input. 
Then if f and g are computable, so is their composite go f. In fact, given 
the input n, a program for go f could just take the output f(n) of a program 
for f, and feed this number f(n) as input to the program for g, obtaining 
the required output go f(n). Thus the set of computable functions forms a 
semigroup of functions on N. 


DEFINITION 2.7 (Identity function.) For any set X, the identity 
function idx is defined by idx : X ~ X;xren. 


Note that for sets X, Y and f : X — Y, we have 


DEFINITION 2.8 (Monoid of functions.) A set S of self-maps on a 
base set X is said to be a monoid of functions on X if it forms a semigroup, 
and if the identity function idx is an element of S. 


Clearly X* is a monoid on X. For a slightly less trivial example, note that 
the identity function idy on the set N of natural numbers is computable. Given 
a natural number n as input, consider the “lazy” program which immediately 
returns n as output. Thus the set of computable functions in Example 2.6 is 
a monoid of functions on N. 

For an element f of a monoid of functions on a set X, the power notation 
may be extended by setting f° = idx. 


FUNCTIONS 31 
Example 2.9 (Linear functions.) 


By (2.8), the set L(2, R) of linear functions from R3} to itself forms a semigroup 
of functions on R}. Now for the 2 x 2 identity matrix 


= |p) (2.13) 


the linear function L7, is the identity function idpx , so L(2, R) forms a monoid 


of functions on R3. l 


2.5 Injectivity and surjectivity 


A function f : X — Y is required to assign a unique function value f(x) in 
the codomain Y to each argument x from the domain X. On the other hand, 
it may happen that different arguments are assigned the same function value. 
For instance, with the squaring function sq: Z — N of (2.1), we have 


sq(—5) = (—5)? = 25 = 5? = sq(5). 
DEFINITION 2.10 (Injective function.) A function f : X > Y is 
said to be injective, or an injection, or “one-to-one,” if 
f(x) = f(a’) implies x=c' (2.14) 
for all elements x and x’ of the domain X. 
Expressing the injectivity condition (2.14) another way, the equation 
f(a)=y (2.15) 


is required to have a unique solution x in X for each element y of the image 
(2.4) of f. By default, any function with empty domain is injective. The 
restricted squaring function 


sq:N>N;n n? (2.16) 
is injective, while the original squaring function sq: Z — N is not. This shows 
why the domain is an integral part of the specification of a function. 

PROPOSITION 2.11 (Retracts of injective functions.) 


Let f : X — Y be injective, with nonempty domain. Then there is a function 


r:YoX (2.17) 
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such that 
rof=idx. (2.18) 


PROOF Pick an element zp of X. For a codomain element y that does 
not lie in the image f(X), define r(y) = xo. Now consider an element y of 
the image f(X). By the definition (2.4) of the image, the equation 


f@)=y 
has a solution. Since f is injective, the solution is unique. Define r(y) to 


be this unique solution element x,. We obtain a function r: Y — X. Now 
ro f:X — X. Then for each element x of X, we have 


ro f(x) =r(f(z)) = ap) = = idx(2), 
verifying (2.18). (] 


DEFINITION 2.12 (Retracts.) A functionr : Y — X is called a 
retract of a function f: X — Y ifro f =idx. 


PROPOSITION 2.13 (Functions with retracts are injective.) 
If a function f : X — Y has a retract, then it is injective. 


PROOF Letr:Y —X bea retract for f. Then 


f(z) = f(a’) implies 2 =ro f(x) =ro f(a’) =a" 


for x, x’ in X. J 


Proposition 2.11 shows that each injection with nonempty domain has a 
retract. Note that an injection f might have many retracts, because of the 
arbitrary choice of the element xg in the proof of the existence of the retract 
(Exercise 22). Also, note that the identity function idg on the empty set is 
its own retract. 


DEFINITION 2.14 (Surjective function.) A function f : X — Y 
is said to be surjective, or a surjection, or to map onto its codomain, if the 
codomain and image coincide: Y = f(X). 


Equivalently, a solution x to the equation f(x) = y of (2.15) exists for each 
element y of the codomain. In yet another formulation, the inverse image 


fy} = {x in X | f(x) =y} 
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is required to be nonempty for each element y of Y. Note that the only 
surjective function with an empty domain is the identity function idg on the 
empty set. The absolute value function abs : Z — N of (2.2) is surjective, 
while the absolute value function 


abs: Z — Zin |n| (2.19) 


is not, since the equation 
In| = —5 


has no solution n. This shows why the codomain is an integral part of the 
specification of a function. 


PROPOSITION 2.15 (Sections of surjective functions.) 
Let f: X — Y be surjective. Then there is a function 


s:Y xX (2.20) 


such that 
fos=idy. (2.21) 


PROOF If X is empty, then so is Y, and f is just the identity function idg. 
In this case, s = idg makes (2.21) work. Now suppose that X is nonempty. 
For an element y of Y, there is an element x of X such that f(x) = y. Choose 
the function value s(y) as one such element z,. Then a function s: Y — X 
is defined. For each element y of Y, we have 


fos(y) = f(s(y)) = f(y) =y = idy(y), 
verifying (2.21). (] 


DEFINITION 2.16 (Sections.) A functions: Y — X is called a 
section of a function f: X ~ Y if fos =idy. 


PROPOSITION 2.17 (Functions with sections are surjective.) 
If a function f : X — Y has a section, then it is surjective. 
PROOF Let s:Y —X bea section for f. Then 
f(s(y)) = fo s(y) = idy(y) =y 
for each element y of Y. l] 
Proposition 2.15 shows that each surjection has a section. Note that a 


surjection f might have many sections, because of the arbitrary choice of the 
elements x, in the proof of the existence of the section (Exercise 23). 
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2.6 Isomorphisms 


DEFINITION 2.18 (Bijective function, isomorphism of sets.) A 
function f : X — Y is said to be bijective, or an isomorphism (of sets), or a 
bijection, if it is both injective and surjective. 


PROPOSITION 2.19 (Inverses of isomorphisms.) 
Let f: X — Y be a byection. Then there is a function 


g:Y HX (2.22) 


such that 
gof=idx and fog=idy. (2.23) 


PROOF By Propositions 2.11 and 2.15, we have 
rof=idy and fos=idy. (2.24) 
By the associativity of function composition, we have 
r=roidy =rofos=idyos=s. 


Define g =r = s. Then (2.23) follows from (2.24). (] 


DEFINITION 2.20 (Inverse, invertible function.) For a function 
f:X - Y, a functiong: Y — X satisfying go f = idx and fog = idy 
is called an inverse of f. A function is described as invertible if it has an 
INVETSe. 


Example 2.21 (Natural logarithms and the exponential function.) 
Let (0,00) denote the set (open interval) of positive real numbers. Then the 
exponential function exp : R — (0,00); x + e* is invertible, with the natural 
logarithm function log : (0,00) — R as an inverse. The equations (2.23) are 
the familiar relationships 

loge” =a 


for real numbers x and 
eley — y 


for positive real numbers y. l] 


Proposition 2.19 shows that bijections are invertible. Conversely, an inverse 
g of an invertible function f is both a retraction and a section for f. Thus an 
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invertible function f is both injective and surjective. In other words, invertible 
functions are bijective. 


Example 2.22 (Inverses of real shifts.) 
For each real number r, the shift 0, of Example 2.5 has the shift o_, as an 
inverse. Thus the shifts are bijective. 


Recall that sections and retractions need not be unique. With inverses, the 
situation is different. 


PROPOSITION 2.23 (Uniqueness of inverses.) 


The inverse of an invertible function is unique. 


PROOF Let g: Y — X be an inverse of a function f: X — Y. Ifa 
function h: Y — X satisfies 


ho f =idx or foh=idy, 


then 
h=hoidy =hofog=idy og=g 
or 
g=goidy =gofoh=idxyoh=h 
respectively. In particular, g: Y — X is uniquely specified by (2.23). l] 


In view of Propositon 2.23, we can speak of the inverse f—! of an invertible 
function f. Note that 


(f)" =f 
for an invertible function f, so that inverses of invertible functions are again 
invertible. 
If there is an isomorphism f : X — Y from a set X to a set Y, we often 
write 


neay (2.25) 


and say that the sets X and Y are isomorphic. In this case Y = X, by virtue 
of the isomorphism f~'. We will often want to show that two sets X and Y 
are isomorphic. The standard technique for this is to exhibit two mutually 
inverse functions f: X — Y andg: Y — X. 


Example 2.24 

The set Z of integers is isomorphic with the subset N of natural numbers. 
Consider the function f : N — Z defined by setting f(0) = 0 and f(2r) =r, 
f(2r — 1) = —r for positive integers r. Consider the function g : Z — N 
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defined by g(n) = 2n for n > 0 and g(n) = 2|n| — 1 for n < 0. Then f and g 
are mutually inverse. l] 


The proof of the following proposition is left as Exercise 34. 


PROPOSITION 2.25 
Consider functions f: X — Y andg:Y — Z. 


(a) If f and g are injective, then so is go f. 
(b) If f and g are surjective, then so is go f. 
(c) If f and g are bijective, then so is go f. Moreover, (gof)~! = f-tog™t. 


Example 2.26 (Counting.) 
For each natural number n, consider the finite set 


n= {0,1,2,...,n—1} (2.26) 


of natural numbers less than n. Note that the set n has n elements. In 
particular, O is the empty set. Now if a finite set X has n elements, say 
X = {x0,21,.--,2%n—-1}, then there is a bijection 


kin Xin ag;. (2.27) 


Indeed, aset X has n elements if and only if there is a bijection k : n — X. We 
may say that k counts the elements of X. The number of elements in a finite 
set X is called the size or order of X. It is written as |X|. Proposition 2.25(c) 
may be used to show that two finite sets X and Y are isomorphic if and only 
if |X| =|Y| (Exercise 35). 


2.7 Groups of permutations 


DEFINITION 2.27 (Groups of permutations.) Let X be a set. 
(a) A bijective function f : X — X is called a permutation of the set X. 


(b) A set G of permutations on X is said to be a group of permutations of 
X or a permutation group on the set X if it is a monoid of functions 
satisfying the additional property 


f in G implies f~' in G. (2.28) 


The property (2.28) is known as closure under inversion. 
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Let X! be the set of all permutations of a given set X. According to 
Proposition 2.25(c), X! forms a semigroup of functions on X. Since idx is 
a permutation, X! forms a monoid of functions on X. Finally, the fact that 
inverses of invertible functions are invertible means that X! forms a group of 
permutations. This group X! is known as the symmetric group on X. (For a 
justification of the notation, see Exercise 44.) 


Example 2.28 (The group of real shifts.) 

The monoid Xz of shifts on R (compare Example 2.5 and Exercise 8) forms 
a group of permutations of R, since (,)~! = a_, as noted in Example 2.22. 
On the other hand, consider the set 


pas bea een 


of shifts by nonnegative real numbers r. This set does form a monoid of 
permutations on R. However, it does not form a permutation group on R, 
since it does not satisfy the property (2.28) of closure under inversion. 


Example 2.29 (The symmetric groups S,,.) 
For each natural number n, write S,, for the symmetric group n! on the set 
(2.26) of natural numbers less than n. The group S;, is called the symmetric 
group on n symbols. For a set {a1,a@2,...,a,} of distinct elements of n, the 
cycle 

(a1 ag... Ay) 


denotes the bijection 


n n;a@1! a2, 42 ' Q3,+++,Ar—1 7 Gr, Ar > QL 


with «+ « for elements x of n not included in the set {a1,a@2,...,a,}. It 
is conventional to write idy as the cycle (0). Two cycles (a1 az ... a) and 
(by bg ... bs) are said to be disjoint if the corresponding sets {a1,a2,...,a,} 
and {b;, b2,...,a@s} are disjoint (have no common element). Each permutation 
may be written as a product of mutually disjoint cycles. For example, 

0 38,1 12H7,3 6,4 8 6H0,7 2,84 


in Sg may be written as the product (0 3 6) 0 (2 7) 0 (4 8) of disjoint cycles. 
By following the effect of these functions, it is easy to express products of 
permutations as products of disjoint cycles. For example, 


(0 3 6) 0(27)0(48)0(04726)0(1 8) =(081 4 2) 0(36) 
since (0 3 6) 0 (2 7) 0 (4 8)0(047 2 6) 0 (1 8)(0) 
= (0 36)0(27)0(48)0(0 47 2 6)(0) 
= (0 36) 0 (2 7)0(4 8)(4) = (0 3 6) 0 (2 7)(8) =8 


and so on. (] 
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FIGURE 2.1: A regular n-gon. 


Example 2.30 (The cyclic groups C;,.) 
For each positive integer n, the cyclic group Cy, consists of the n permutations 


(0123... (n—2) (n—-1)), 
010342 =o Ga, 
O123e:2 G2) GAD) 52x 
.-, (0 (n—1) (n—2) ... 321), 


and (0) from S;,. These permutations correspond to the respective counter- 


clockwise rotations of a regular n-gon by the angles 
2 2 2 2 
an g=.. gt ae (n-))—, 0 
n n 


radians (Figure 2.1). (] 


Example 2.31 (The Klein 4-group.) 
The Klein 4-group V4 is the set 


{ (0), ( 1)(2 3), (0 2)(1 3), (0 3)(1 2) F 


of permutations. It forms a group of permutations of the set n for each natural 
number n > 4 (Exercise 40). 


The cycle notation is extended to denote permutations of arbitrary sets. 
For example, 


(AY 30 JO 79) 0 (KY 49) 0 (100 99 29) 


might denote a shuffle of the suit Y of hearts from (1.3). For an application 
of permutations to elementary cryptography, see Study Project 3 at the end 
of the chapter. 


2.8 


1. 


10. 


11. 


12. 


13. 
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Exercises 


Show that the empty set @ = { } cannot be the codomain of a function 
f :X — © with nonempty domain X. 


. Draw an arrow picture to illustrate all the functions 


f,9,h, gof, ho(gof), hog, (hog)of 


involved in the associative law (2.5). 


. Verify that (2.8) holds for all 2-dimensional real column vectors x. 


. Let m,n, and p be positive integers. Show that for an m x n real matrix 


A, there is a function RP? — R?; Xr AX. 


. Let X be a finite set with n elements. Show that the semigroup X~* 


has n” elements. 


. Show that the set of nondecreasing functions forms a monoid of functions 


on R. 


. A function f : R — R is said to be strictly increasing if x < y implies 


f(x) < f(y). Show that the set of strictly increasing functions forms a 
monoid of functions on R. 


. Show that the set © of shifts in Example 2.5 forms a monoid of functions 


on R. 


. Verify the equation (2.11), and show that it also holds for n = 0. (Hint: 


Consider using induction with n = 0 as the basis.) 


For each natural number n, define the power map 
Pn: RoR rea”. (2.29) 


Show that the set P of all power maps forms a monoid of functions on 
the set R. 


Let n be an integer such that na = c,(x) for all integers x. Show that 
n=0. 


Let Y be a subset of a set X. Under what conditions on X and Y does 
the set (2.10) of constant functions form a monoid of functions on X? 


A function f : R > R is said to be affine if there are real numbers m 
and c such that 
fitrem-a“te (2.30) 
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Show that the set A of all affine functions forms a monoid of functions on 
R. (In calculus, affine functions are often called “linear,” but in algebra 
it is best to reserve this term for the case c = 0.) 


A function f : R > R is said to be a polynomial function if there is a 
natural number n and real numbers fo, fi,..-, fn such that 


f() = fra” t+...+ fiat fo 
for x in R. 


(a) Show that the set of all polynomial functions forms a monoid of 
functions on R. 


(b) Show that there is a function f : R — R which is not a polynomial 
function. 


Show that the set C(R) of all continuous functions f : R — R forms a 
monoid of functions on R. 


Let r be a positive integer. Let C’(R) denote the set of all functions 
f :R- R for which the r-th derivative f"(x) exists at each real number 
x. Show that C’(R) forms a monoid of functions on R. 


Let X be an infinite set. A function f : X — X is said to be almost 
identical if the set 


{cE X|a F f(x)} 


of elements x of X, differing from their image f(a) under f, is finite. 
Let F be the subset of X* consisting of the almost identical functions. 
Show that F is a monoid of functions. 


Show that the power map p,, of (2.29) is injective if and only if n is odd. 


Show that the power map p, of (2.29) is surjective if and only if n is 
odd. 


Show that sections are injective. 
Show that retracts are surjective. 
Show that the injection (2.16) has infinitely many retracts. 
Show that the surjection (2.2) has infinitely many sections. 
Consider the function 

f: ROR awe a(ae—-1)(e4+1). 


(a) Show that f is not injective. 
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(b) Using the Intermediate Value Theorem or otherwise, show that f 
is surjective. 


25. Consider the function 
f: RAR tre’. 
(a) Show that f is injective. 
(b) Is the natural logarithm function a retraction for f? 
(c) Show that f is not surjective. 
26. Let f : X — X be a function with finite domain X. Show that the 
following three conditions are equivalent: 
(a) f is injective; 
(b) f is surjective; 
(c) f is bijective. 


27. Consider the 2 x 2 real matrix 
ab 
re F i | 
Show that the following three conditions are equivalent: 
(a) The linear function L, is injective; 
(b) ad— bc £0; 
(c) The linear function L, is surjective. 
28. Let f : X — Y be a function from a finite set X with m elements to 
a finite set Y with n elements. If m > n, show that f is not injective. 


(This is known as the Pigeonhole Principle: If m pigeons occupy n holes, 
and m > n, then at least two pigeons have to share.) 


29. Show that the set R of real numbers is isomorphic to its proper subset 
(0,00) of positive real numbers. (Compare Example 2.21.) 


30. Show that a finite set X cannot be isomorphic to a proper subset Y of 
X. (Recall that a subset Y of a set X is proper if there is an element x 
of X that does not lie in Y.) 


31. Let f: X — Y;x4+ f(x) be a function. 


(a) Show that there is a unique subset Y’ of Y such that the minimal 
corestriction 


X 3 Y';2- f(x) (2.31) 


is a surjective function. 
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(b) What is the minimal corestriction of the absolute value function 
(2.19)? 


32. Let f: X — Y;a f(x) be a function. 
(a) Show that there is a subset X’ of X such that the function 
X'S Y;2 f(z) (2.32) 


is an injective function. 


(b) Give an example to show that the subset X’ need not be unique. 


33. Let f : X — Y bea function with nonempty domain X. Show that 
there is a function g: Y — X such that f = fogof. 


34. Prove Proposition 2.25. 
35. Let X and Y be finite sets. Show that X = Y if and only if |X| =|Y]. 


36. Let X be a set. Suppose that a nonempty semigroup G of permutations 
of X satisfies the property (2.28) of closure under inversion. 


(a) Show that idx lies in G. 
(b) Conclude that G is a permutation group on X. 


37. Express 
(072 1)0(3 45 6)0(0 62 4)0(3157) 


as a product of disjoint cycles. 


38. Let @ and 


Q= (a1 XQ... Lp-1 Lr) 


be permutations of a finite set X. Show that 
BoacB* =( B(x) B(x2) ... B(tr—-1) B(ar)). 


39. Let n be a positive integer. Show that S,, has n! elements: To specify 
a permutation a of n, there are n choices for a(0), then n — 1 choices 
for a(1) (avoiding a(0)), then n — 2 choices for a(2) (avoiding a(0) and 
a(1)), and so on. 

40. Show that V4 forms a group of permutations of each set n with n > 4. 


41. Show that for distinct elements a1, a2,...,@,—1,a, of the set n, 


(@, Qq ... Ap—1 Gy) = (Ag ag ... Gp Ay). 


42. 


43. 


44. 


45. 


46. 


47. 
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Show that for distinct elements a1, a2,...,@,;—1, a, of the set n, 
(de Ge sa. Ope) SS (GaAs 30k Os 215 


Show that 


(ay QQ ... Ar-1 ay) O° (by bo eats bs—1 bs) 


= (by by eee bs—1 bs) (e) (ay ag... Ar—-1 ay) 
for disjoint cycles (a; a2 ... a,) and (b; bg ... 0g). 


Let X be a finite set with n elements. Show that the symmetric group 
X! has n! elements. 


Let Aff(R) be the set of all affine functions (2.30) with m 4 0 (compare 
Exercise 13). Show that Aff(R) forms a group of permutations of R. 


Suppose that a group G of permutations of R contains the real shifts oq 
and oy» for real numbers a and b. 


(a) Show that G contains Oma for each positive integer m. 
(b) Show that G contains oma for each integer m. 


(c) Show that G contains o¢+ns for each integral linear combination 
ma-+nb of a and b. 


Suppose that a group G of permutations of R contains the real shifts a9 
and os. Show that G contains o,, for each integer n. 


Study projects 


1. Not all functions are computable. Consider a program to compute 


a certain function N — N. This program, as a list of instructions in a 
certain programming language, can ultimately be written out as a long 
but finite string of binary (base 2) digits: 


100011001011100001111000100100100100101000000. . 011001001 


Add a 1 to the left of each such string. (Why do we need to do this?) 
The number that is represented to base 2 by the augmented string — 
compare Exercise 19 of Chapter 1 — is called the Godel number of the 
program. List all possible such programs, for all computable functions 
N —N, in increasing order of Gédel numbers. Suppose that the list is 
Po, Pi, Po,.... Suppose that for natural numbers m and n, the program 
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Py, outputs the function value pm» when given input n. Now show that 
the function 
fiNON a poe +1 


is not computable, since for each natural number n, the function value 
f(”) = pan +1 differs from the function value p,,, computed by P,, when 
applied to the argument n. (The technique used here to construct f is 
known as Cantor diagonalization.) 


. Symmetries of the regular tetrahedron. Consider the solid regular 


tetrahedron: 


(a) Show that the rotations by 120° and 240° about an axis through 
the vertex 0 and the midpoint of the triangular face 123 implement 
the permutations (123) and (132) of the vertices. 


(b) Show that the symmetries of the regular tetrahedron include all 
the permutations 


(123), (023), (013), (012), 
i \ (2.33) 


(c) Show that the full set of symmetries of the tetrahedron consists of 
the union of the set (2.33) and the Klein 4-group V4. 


(d) Determine a geometric or combinatorial rule that can decide when 
a product of two elements of (2.33) will lie in V4. 


(e) Look up the structure of a methane molecule CH,. How do its 
symmetries relate to the symmetries of the regular tetrahedron? 


3. Cryptography is the art of designing secret codes. An original text, 


the cleartext, is transformed by a bijection c into an encoded text for 
transmission across some channel that may be prone to eavesdropping. 
The person for whom the message is intended then applies the inverse 
bijection d = c~+ to decode the encoded text. 
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Among the most elementary codes are those given by a permutation of 
the alphabet, for example the “keyboard permutation” 


c = (QWERTYUIOP) 0 (ASDFGHJKL) 0 (ZXCVBNM). (2.34) 
(a) Apply the keyboard permutation (2.34) to encode the cleartext 


SEND MORE MONEY. 


(b) Suppose that the keyboard permutation was used to produce the 
coded message 
JPDYSHRD GTRRF. 


What was the original cleartext? 


4. Cryptanalysis is the eavesdropper’s art of breaking secret codes, of 
reading encoded messages without explicitly being given details of the 
bijection c used to encode them. Codes given by a permutation c such as 
the keyboard permutation (2.34) are quite easy to break, using the fact 
that various letters and letter combinations in the English language have 
different frequencies. For example, the commonest letters in decreasing 
order of frequency are 


PP AO EN ie cae (2.35) 


while combinations such as TH arise often, and Q is almost always 
followed by U. Sometimes, knowledge of the context of the message can 
be used. For example, if the message concerns Persian Gulf countries, 
like Iraq or Qatar, then the letter following a Q might not be a U. 


Suppose that an eavesdropper intercepts the message 


CZYSY ABEE FY UKCDZYQ TO SKBR 


2.36 
TS OTL BR UKSCQ TO CZY YKQC 2:50) 


encoded by a permutation c. 


(a) Order the letters in the message (2.36) according to their relative 
frequency. 


(b) Compare this ordered list of letters with (2.35), and make an initial 
partial guess at the decoding bijection d = c7!. 

(c) Now, assuming that the message (2.36) consists of ordinary English 
words, try to decode the message as completely as you can. 

(d) If you cannot make sense of the message, try a different assignment 
of the commonest letters. Also, be aware of the few possibilities for 
two-letter words in the English language. Try using the fact that 
TH is a common pair of letters. 
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In order to remove the extra information given by the spaces between 
words, a secret message such as (2.36) might be transmitted in a series 
of 5-letter groups such as 


CZYSY ABEEF YUKCD ZYQTO SKBRT 
SOTLB RUKSC QTOCZ YYKQC 


instead. 


2.10 Notes 
Section 2.1 


Various sources use various names in place of our domain/codomain/image 
terminology. For instance, the terms “target” and “range” appear frequently, 
the latter in all three possible roles: 


e The range is the place where one goes to shoot bullets, so the “range” 
is the domain, the place from which the arguments shoot out; 


e The codomain is the set over which the function values “range” ; 
e The image is the exact set over which the function values “range.” 


We hope that our terms are unambiguous. 


Section 2.5 


Strictly speaking, the proof of Proposition 2.15 uses the Axiom of Choice 
to select the elements z,. In fact, the existence of a section to each surjective 
function is one of the many equivalent formulations of the Axiom. 


Section 2.7 


For infinite sets X, some authors reserve the term “permutation” to denote 
bijections f : X — X that are almost identical in the sense of Exercise 17. 
C.F. Klein was a German mathematician who lived from 1849 to 1925. 


Section 2.9 


K. Godel was an Austrian (Moravian) logician and mathematician, later 
emigrating to the United States, who lived from 1906 to 1978. G. Cantor was 
a German mathematician who lived from 1845 to 1918. 
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2.11 Summary 


In algebra, the notation for a function is more precise than the notation 
commonly used in calculus. Associated with a function 


fiXY¥;04 f(x) 


are a number of key terms: 
e The function assigns a unique value f(x) ... 
e ...to each argument 7; 
e The domain X is the set from which the arguments are taken; 


e The codomain Y is the set in which the function values are expected 
to lie; 


e The image is the set 
F(X) = {f(@) | @ in X} 
of actual values which occur; 


e For each element y of the codomain Y, the inverse image of y is the 
set 
fy} = {x in X | f(x) =y} (2.37) 
of arguments x that are assigned the function value y. 
Note that the inverse image sets (2.37) exist for all functions f, regardless 
of whether f is invertible or not. If f does happen to be invertible, then 
f-t{y} = {f(y}, so the notation is consistent in that case. 
At a more basic level, it is important to distinguish between elements, sets, 
and functions. An equation may say that two elements are equal: 


idx (a) =a 
or that two sets are equal: 
sq(R) ={s € R|s>0} 
or that two functions are equal: 
abs = sqrt o sq. (2.38) 


Do not write equations with a function on one side, and an element on the 
other. For example, “ idx = x ” would be meaningless, since the left-hand 


a“ 
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side is a function, while the right-hand side is an element. In calculus, one 
might see an equation of the form 


jz] = Var, 


and this equation could be ambiguous. It could be expressing the equation 
(2.38) between functions, or it could be an expression of the equality between 
two real numbers, say if x had been specified as a certain real number. The 
notation in algebra is carefully designed to avoid this kind of ambiguity, and 
to make mathematical reasoning more transparent. 

Added care is required when discussing a semigroup G of functions or group 
G of permutations on a set X. In this case, we must distinguish between 
the elements of the base set X, and the elements of the semigroup G. The 
semigroup G' is a set whose elements are functions, of the form f: X — X. 


Chapter 3 


EQUIVALENCE 


When we do mathematics, we study the structures that underlie the various 
phenomena encountered in the world. For this to work, mathematics has to be 
able to filter out all the detail that is not relevant to the particular structure 
being studied. Equivalence is the filter. 

The most basic example is the concept of number. What does the number 
3 stand for? A set X has 3 elements if and only if there is a set isomorphism 


PAD) (3.1) 


counting off the elements of X as f(1), f(2), and f(3). The function f has to 
be injective, so that no element of X gets counted twice. The function f has 
to be surjective, to make sure that each element of X gets counted. 

The only problem here is the circularity. To characterize the number 3, we 
have used that number in the domain of the function (3.1). To escape the 
circularity, we can decide to consider two sets as equivalent for the purposes 
of counting whenever they are isomorphic. The number 3 then emerges as the 
property which is common to each of the sets that are isomorphic to some 
given 3-element set (for instance {1,2,3} or {9,{@}, {{@}}}). The particular 
details of the elements in the sets are not relevant to the problem of counting. 
They are filtered out by the equivalence. 

Equivalence relations play a key role in the analysis of general functions. 
Each function determines an equivalence relation on its domain, identifying 
two elements whenever they have the same function value. Conversely, it 
transpires that every equivalence relation is of this type. 


3.1 Kernel and equivalence relations 
Consider the squaring function sq : Z > Z;n + n? of (2.3). For two 
integers n and n’, 


/ 


sq(n) =sq(n’) if and only if n’=-+n. 


In other words, the integers n and n’ are assigned the same function value 
if and only if they both lie in the same equivalence class {r,—r}. These 
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equivalence classes partition the domain set Z of integers, meaning that Z 
decomposes as the union 


Z = {OJ} U {41} U {42} U{H3} vu... (3.2) 


of mutually disjoint subsets, the equivalence classes. 


DEFINITION 3.1 (Kernel relation of a function.) Consider a 
function f : X > Y. A pair (x,2x') of elements of X is said to be in the 
kernel relation ker f, and we write x ker f x’ or x (ker f) 2’, if and only if 
x and x’ are assigned the same function value by f. Formally, 


x (ker f) a’ if and only if Onda e (3.3) 
Previously, we studied order relations such as < on R or the divisibility 
relation | on N. These order relations had the properties of being reflexive, 


transitive, and antisymmetric. Now the kernel relation ker f of a function 
f:X —Y is certainly reflexive: 


x (ker f) x 
for all x in X. It is also transitive: 
(a (ker f) x’ and a’ (ker f) x” ) implies x (ker f) x”, 


since f(a) = f(a’) and f(x’) = f(x”) imply f(x) = f(a”). The third property 
of the kernel relation is called symmetry: 


x (ker f) 2’ implies x’ (ker f) x. 


These properties of kernel relations are formalized in the important concept 
of an equivalence relation. (Remember RST in alphabetical order!) 


DEFINITION 3.2 (Equivalence.) Let R be a relation on a set X. 
(R) The relation R is reflexive if x Rx for each element x of X. 


(S) The relation R is symmetric if for elements x and x’ of X, the relation 
x R2' implies x’ Rx. 


(T) The relation R is transitive if for elements x, x’, and x” of X, the 
relations x Ra’ anda’ Rx" implyx Rx". 


Finally, the relation R is an equivalence relation on X if it satisfies all three 
conditions (R), (S), and (T). 


PROPOSITION 3.3 (Kernels are equivalence relations.) 
Let f: X — Y be a function. Then the kernel relation ker f of f, specified 
by (3.3), ts an equivalence relation on the domain X of the function f. 
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3.2 Equivalence classes 


The kernel of the squaring function sq : Z — Z yielded the partition (3.2) 
of Z. Each equivalence relation on a set yields a partition of the set. 


DEFINITION 3.4 (Equivalence class.) If R is an equivalence relation 
on a set X, define the equivalence class of x under R to be the set 


[z4]p ={a' in X |x Ra'} (3.4) 


of all elements x’ of X that are related to x by R. 


Note that each class [2] r is nonempty, since by reflexivity it at least contains 
the element x itself. For the kernel relation ker f of a function f: X — Y, 
and for an element x of the domain X, the equivalence classes are given by 
the inverse image sets 


[t]ker ¢ = f~'{f(x)}- (3.5) 


Here is the key partitioning property of equivalence relations. 


PROPOSITION 3.5 (Equivalence classes are disjoint or equal.) 
Let R be an equivalence relation on a set X. Let x, and x2 be elements of 
X. Then the two equivalence classes [x1] Rr, |t2]R are either disjoint: 


[xilr M [xa] = 


or else equal: [1]r = [x2|r. In the latter case, x1 R x2. 


PROOF Suppose that [zi]r and [x2]r are not disjoint, so they have a 
common element x’. Then 2; R wx’ and x2 R x’ by the definition (3.4) of the 
equivalence classes. By symmetry, 2’ R x. Then x, Rv’ and 2’ R x2 imply 
x1 R x2 by transitivity. 

Now suppose that 2” is an element of [71], so that 2; Rx”. Then 


" 
wmRaRe 


implies x2 R «” by transitivity, so that x” is an element of [xa]r. Similarly, 
each element of [xv2]r is an element of [xi]. It follows that the two classes 
[vi]r and [x2]r are equal. 


To conclude this section, we will show that each equivalence relation R on 
a set X is the kernel relation of a suitable function with X as domain. Let 
XR denote the set 


{[z]r | x in x} 
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of all equivalence classes under R. It is very important to note that Xp is a 
set of sets: The elements C' of the set Xp are themselves sets (the equivalence 
classes). One of the main difficulties in understanding algebra arises if the 
different levels of the hierarchy 


elements — sets — sets of sets 


are confused. 


PROPOSITION 3.6 (Equivalence relations are kernels.) 
Let R be an equivalence relation on a set X. 


(a) There is a surjective function 


nat R: X > Xp x [aI)r. 


(b) The kernel relation of the function nat R is R itself. 


PROOF Part (a) is immediate. For part (b), note that two elements 11, x2 
of X are related by the kernel relation of nat R if and only if [xi)r = [r2]R. 
By Proposition 3.5, the latter condition holds if and only if x; and x2 are 
related by R. 


By Proposition 2.15, the surjective function nat R: X — Xp has a section 
rep : Xr — X, with rep(C) as an element of C for each equivalence class 
C in Xp. The element rep(C) is called a representative for the equivalence 
class C’. Each equivalence class C' may be written as the class 


C = |rep(C)|r 


of its chosen representative. Sometimes, to avoid having to consider the set 
Xp of sets, it is convenient to consider the image set rep(X,R) instead, the 
set of representative elements. Note that the sets Xp and rep(XR) are iso- 
morphic, by virtue of the mutually inverse functions rep : Xr — rep(Xpr) 
and rep(Xr) > Xz xr [a]R. 


Example 3.7 (Choosing representatives.) 

Let X be the set of citizens. Suppose that two citizens are related by the 
equivalence relation R if and only if they belong to the same congressional 
district (riding, parliamentary constituency, ...). Then as a representative 
rep(C) for an equivalence class C, one may choose the congressional repre- 
sentative (Member of Parliament, ...) for that district. Of course this choice 
of representative is not unique, and may change after an election! 
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3.3 Rational numbers 


4 3 2 4 
e e e e 6 e e e 
1 
e e e e 5 e e e e 
4 
5 
e e e 4 e 
2 
3 
e e e 3 e e e 1 
2 
e e 2 e 1 
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e e 1 . z 
4 3 2 1 
- e © e © © e © e-> (0) 
0 1 2 3 4 5 
e —1 e e e 
—2 e e e 
aq 
e e —3 e e e e 2 
e —-4 e e e e 
e e e e —5 e e e e 
—1 


FIGURE 3.1: Rational numbers. 


Rational numbers provide a good example of the use of equivalence classes 
in algebra. Consider the set 


X ={(n,m)|n,m in Z, n4 0} 
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of pairs of integers, the first integer of each pair being nonzero. The set X is 
illustrated by the solid dots in Figure 3.1. Define a relation R on X by 


(n1,m1) R (n2,m2) if and only if MyNg = MN, . (3.6) 


PROPOSITION 3.8 


The relation R is an equivalence relation on X. 


PROOF The conditions required by Definition 3.2 have to be verified. The 
reflexivity (R) and symmetry (S) are immediate. To verify the transitivity, 
suppose 

(n1,™1) R (n2,mM2) R (ng,ms3) (3.7) 


for elements (n;,m,;) of X (with 1 <i <3). Then 
Mmyng= men, and mgn3 = mMm3N2. 
Multiplying these two equations together yields 
MyNgMgn3 = M3N2gM2N1 . 
Since ng is a nonzero integer, it can be canceled to yield 
MiM2n3z = M3M2Nq . 
If mg is nonzero, it can also be canceled to give the equation 
Myn3g = M3ny4 (3.8) 


showing that (n1,™m1) R (n3,m3). If m2 = 0, then (3.7) shows minz = 0 and 
m3n2 = 0. Since ng is nonzero, these latter equations give m; = 0 = m3. The 
equation (3.8) then holds trivially, so again (n1,m1) R (n3,m3). (] 


In Figure 3.1, the equivalence classes for R are the sets of solid dots lying on 
the same line through the origin. (A few of these lines have been drawn.) The 
set Q of rational numbers is defined to be the set Xp of equivalence classes. 
For an element (n,m) of X, write m/n or 


m 

m= [(n,mle 
for the corresponding equivalence class. If n = 1, the class is often written 
just as m instead of m/1. If m = 0, the class is often written just as 0 
instead of 0/n. For nonzero rationals m/n, a preferred representative is the 
one in lowest terms, meaning n > 0 and gcd(n,m) = 1. Thinking of the 
rational as a line through the origin in Figure 3.1, this representative is the 
first dot encountered on proceeding along the line to the right from the origin 
(Exercise 9). 
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The set Q of rational numbers has a well defined multiplication given by 


my, Mg mms 


hee — ane (3.9) 
It has a well-defined addition given by 
my, +: mg, = MyN2 + MeN, ; (3.10) 


ny n2 nynNg2 


The multiplication (3.9) and addition (3.10) appear to depend on the choice 
of the particular representatives (n1,7™m1) and (n2,m2). However, saying that 
these operations are “well defined” means that the same answer is obtained in 
each case, regardless of the particular choice of representative. For example, if 
m/ny = mi,/ni, and m2/nz = m/n5, the definition (3.6) of R gives min}, = 
min, and man, = mbnz. Then mymenins = mimyn Ne, so 


mim, mms 


nyNg nin? 
showing that the multiplication (3.9) is well defined. Verification that the 
addition (3.10) is well defined is assigned as Exercise 7. 

The equivalence relation R on the set X actually arises as the kernel relation 
of the division function 


\:X 3R;(n,m)% n7'm (3.11) 


(Exercise 8). The set Q of rationals is usually identified with the image of 
the function (3.11), embedded in the codomain R. This is possible because 
of the isomorphism between the set Xp of kernel classes and the image of 
(3.11). In the next section, the First Isomorphism Theorem will provide a 
comparable isomorphism for every function. To conclude this section, we show 
that the function (3.11) is not surjective, proving the existence of irrational 
real numbers, real numbers that are not expressible in the form n~!m with 
(n,m) in X. 


THEOREM 3.9 


Irrational numbers exist. In particular, /2 is not rational. 


PROOF Suppose that V2 is rational, say m/n. Then (m/n)? = 2, so we 
obtain the equation 

m? = 2n? 
between positive integers. However, there can be no such equation, since 
it would violate Theorem 1.14 (page 15). Indeed, Theorem 1.13 yields a 


factorization 
e,e er 
Mm = 2° po”... py 
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of m and a factorization 
n= pf... pf 


of n. Then the factorization of m? contains the even number 2e of prime 


factors 2, while the factorization of the same number m? in the form 2n? 
contains the odd number 1+ 2f of prime factors 2. This contradicts the 
uniqueness part of the Fundamental Theorem of Arithmetic. 


3.4 The First Isomorphism Theorem for Sets 


The division function (3.11) decomposes as a composite of the surjection 
X — Xp, the isomorphism Xp = Q, and the injection Q — R. The topic 
of this section, the First Isomorphism Theorem for Sets, shows that every 
function can be written as a composition 


(injection) o (isomorphism) o (surjection) . 


Consider a function f : X — Y. Since the kernel relation ker f is an equiva- 
lence relation, Proposition 3.6(a) shows that there is a surjective function 


si X — Xker f} t+ [L]ker f- (3.12) 
On the other hand, there is an injection 
Gi HX)CVyry (3.13) 


inserting the image f(X) as a subset into the codomain Y. The remaining 
ingredient is an isomorphism between the set Xjer ¢ of kernel classes and the 


image f(X). 


PROPOSITION 3.10 
Let f: X — Y be a function. Then there is a well-defined bijection 


b: Xtor¢ — F(X); eho s+ f(a) (3.14) 


PROOF § It will first be shown that b is a well-defined injection. Note that 
well-definedness is an issue, since the specification (3.14) apparently depends 
on the choice of the representative x for the kernel class [z]ker ¢. However, for 
elements x and x’ of X, we have 


[t]ker f = [2"]ker ¢ if and only if xkerf x’ ifandonlyif f(x) = f(z’) 


by Proposition 3.5 and the definition (3.3) of the kernel relation. Reading 
in the “only if” direction shows that b is well defined. Reading in the “if” 
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direction shows that b is injective. Finally, it is immediate that b is surjective, 
by the definition (2.4) of the image f(X). l 


We now obtain the First Isomorphism Theorem for Sets. (Later, it will be 
embellished with additional algebraic structure.) 


THEOREM 3.11 (First Isomorphism Theorem for Sets.) 
Let f: X — Y bea function. Then f decomposes as the composite 


f=jobos 


of the surjection s of (3.12), the bijection b of (3.14), and the injection j of 
(3.13). 
The theorem is summarized in the following diagram, of the kind proposed 


in Section 2.2: 


er nD 


| Ie 


Xker f eres F(X) 


Figure 3.2 presents a more naive illustration that may nevertheless be helpful. 


An illustration of the First Isomorphism Theorem. 


FIGURE 3.2: 
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The domain X is the set of cards in a hand. The codomain Y is the full 
set of suits. The function f maps each card in the hand to its suit, so two 
cards are in the relation ker f if and only if they lie in the same suit. The 
equivalence class 


[QU] ker f = {JY, QY, AY} 


consists of all the hearts in the hand, the class 
[KO ]ker f = {10%, KO} 


consists of all the diamonds in the hand, and the class [3é|xer ¢ contains the 
unique club in the hand. The image 


f(X) = {9,0, do} 


is the set of suits appearing in the hand. The First Isomorphism Theorem 
exhibits this set as isomorphic to the set 


Xker f _ {[QU]ker f [AO ner fs [34] ker rt 


of equivalence classes. Indeed, both f(X) and Xxey ¢ each have 3 elements. 
The fact that the 3 elements of the set Xxer ¢ are each themselves sets is 
irrelevant here. When dealing with sets of equivalence classes, you disregard 
the internal details of the classes for a moment, and just consider each class 
as an element. 


3.5 Modular arithmetic 
Fix a positive integer d. For each integer a, define amod d by 
a=qd+(amodd). (3.15) 


In other words, amodd is the remainder given by the Division Algorithm 
with a as dividend and d as divisor. (Compare Proposition 1.6, page 7.) 
Consider the function 


f:Z—-N;aramodd. (3.16) 


The kernel classes [a] moaa of this function are known as congruence classes 
modulo d. Two integers a and 0 are said to be congruent modulo d, written 


a=b modd, (3.17) 


if they are related by the kernel relation ker f, or (equivalently) if they lie in 
the same congruence class, or if they leave the same remainder after division 
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by d. To facilitate working with (3.17), it is helpful to summarize yet more 
equivalent forms of the relation. 


PROPOSITION 3.12 (Characterizations of congruence.) 
Let d be a positive integer. For integers a and b, the following are equivalent: 
(a) a=b mod d; 

(b) d divides a — b; 

(c) a—b is a multiple of d. 


PROOF The equivalence of (b) and (c) is (1.9). Now if (c) holds, say 
a—b=rd for some integer r, we have 


b=a-—rd=qd+ (amodd) — rd = (q—r)d+ (amod d), 


using (3.15). It follows that (b mod d) = (amod d), so (a) holds. Conversely, 
suppose that (a) holds, say a = qd+(amod d) and b = q’d+(amodd). Then 
a—b=qd—qd=(q—()d, so (c) holds. ( 


The bijection b of the First Isomorphism Theorem provides an isomorphism 
Zmodad — Z/a; [a] modd +> amod d (3.18) 
between the set of congruence classes modulo d and the set 
Z/a= {0,1,2,...,d—1} 


of remainders or integers modulo d, the image of the function (3.16). The 
isomorphism is often used to identify a congruence class with its representative 
remainder, so the set of congruence classes is then written as Z/q. 

For d = 2, the set Z/2 = {0,1} consists of the two bits or binary digits 0 
and 1. The remainder 0 stands for the set of even integers, while 1 stands for 
the set of odd integers. Now these sets of odd and even integers follow simple 
arithmetical rules, summarized by the tables in Figure 3.3. The left-hand 
table means that the sum of two even integers is even, the sum of an odd and 


+ /0 1 - 10 1 
0|;0 1 0 |0 0 
1 }|1 0 1 }|0 1 


FIGURE 3.3: Addition and multiplication modulo 2. 
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an even integer is odd, and the sum of two odd integers is even. The right- 
hand table means that the product of two even integers is even, the product 
of an odd and an even integer is even, and the product of two odd integers is 
odd. Similar modular arithmetic holds for general positive divisors d. 


PROPOSITION 3.13 

Let d be a positive integer. Suppose that for integers a; and b; (with i = 1,2), 
a,=b, modd and ag=bo modd. (3.19) 

Then 


ay +ag= by + bo mod d and ay: ag= by < bg mod d. (3.20) 


PROOF By Proposition 3.12 and (3.19), there are integers r; and rg with 


ay = by = ryd and ag — bg = rod. 


(ay t a2) (by t be) => (ay _ by) + (a2 —_ bz) = ryd+ Tod => (rz + ro)d 


and 
ayag = (by + r1d)(b2 + rod) 
=> by be + (r1b2 + bire + ryra)d, 
so Proposition 3.12 yields the required relations (3.20). ( 


COROLLARY 3.14 
There are well-defined operations 


[a] moa a + [b] moda = [a+ 5] moaa (3.21) 


and 
[a] modd‘° [0] modd = |a° b mod d (3.22) 


on the sets Zmoaa and Z/q. 


Note that for each element a of Z/q, the function 
ZLla—> Z/laseeatex 
is the permutation 
((0 mod d) (1modd) (2modd) ... (—1mod d))" 


of the cyclic group Cy (compare Example 2.30). 


3.6 


1. 


11. 


12. 
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Exercises 


Let X be a set of sets. Show that isomorphism between members of X 
is an equivalence relation on X. 


. For the functions sq of (2.1) and abs of (2.2), show that 


ker sq = ker abs. 


. Verify (3.5). 


. Let f : X — Y bea function. Show that the following two conditions 


are equivalent: 


(a) f is injective; 


(b) For each element x of X, the equivalence class [1]ker ¢ has only one 
element. 


. Show that the set of natural numbers may be chosen as a particular set 


of representatives for the equivalence classes of the equal kernel relations 
of Exercise 2. 


. Show that the set of nonpositive numbers may be chosen as a set of 


representatives for the equivalence classes of the equal kernel relations 
of Exercise 2. 


. Verify that the addition (3.10) of rationals is well defined. 


(a) Show that the relation R of (3.6) on the set X is the kernel relation 
of the division function (3.11). 


(b) Conclude directly that R is an equivalence relation, without using 
Proposition 3.8. 


. Suppose that n is a positive integer and m is a nonzero integer. Show 


that gcd(n,m) = 1 if and only if there is no element of the set X lying 
on the interior of the line segment from the origin to (n,m). 


. Show that 3 is irrational. 


Show that ~/2 is irrational. 


Deal yourself a hand of cards, and use the First Isomorphism Theorem to 
analyze the function mapping each card in the hand to its suit. Identify 
the set of equivalence classes for the kernel relation of the function. 
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14. 


15. 


16. 


17. 
18. 


19. 


20. 


21. 
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. Let a,b, and d be positive integers. Show that 
a=b modd 


if and only if a and b have the same rightmost digit in their base d 
representation (compare Exercise 19 in Chapter 1). 


Show that a positive integer n cannot be a perfect square if its decimal 
representation ends in one of the digits 2, 3, 7, or 8. 


Consider the numbers 
123 4 5 67 8 9 10. 


Is it possible to put positive or negative signs in front of each, so that 
the total sum of the signed numbers is zero? 


Repeat Exercise 15, this time with the numbers 


123 4 5 6 7 8 9 10 Ii. 


Show that log, 3 is irrational. 


Consider an angle 6 with 


T 
0<d<-. 
< $5 


Suppose that 
l 
cos#@=— and sind= ue 
n n 


are rational numbers, with positive integers 1, m, and n. Show that 1 
and m cannot both be odd numbers. 


Which of the following three conditions determines the kernel relation 
R of the cosine function cos : R — R; a+ cos x? 


(a) a Ry ifandonlyif r=+y. 


(b) « Ry ifandonly if «=2mn+y for some integer n. 
(c) «Ry ifandonly if «—y=2nn for some integer n. 


Define a relation P on the set R of real numbers by 
xzPy if and only if x—y=2nn for some integer n. 
Show directly that P is an equivalence relation on R. 


Show that the relation P of Exercise 20 is the kernel relation of the 
column vector-valued function 


f: RoR 0K fe 


sin 0 


Conclude that P is an equivalence relation on R. 
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3.7 Study projects 


1. Tonal music. Consider the set X of frequencies of audible sounds, 
measured in Hertz (or cycles per second). Define a relation R on X by 
fi R fo if and only if 


fi/fe is an integral power .. I DIY ON Oe On wot Os 


(a) Show that R is an equivalence relation on the set X. 


Ass Auto A220 Aso Aggo A 


Aoz.5 1760 A3520 


FIGURE 3.4: A piano keyboard. 


(b) If the note A (compare Figure 3.4) labels the equivalence class 
_..,55, 110, 220, 440, 880, 1760,... , 


describe how other notes also correspond to equivalence classes. 
(c) Explain why the equivalence classes are more important in music 
than the actual frequencies themselves. 


2. Continued fractions. Theorem 3.9 shows that there are real numbers 
s, for example V2, that are not rational. In Figure 3.1, this means that 
the line of slope s through the origin moves off to infinity on the right 
without ever exactly hitting a solid dot (n,m). Nevertheless, the line 
does come very close to various dots along the way. For example, the 
line of slope 2 only narrowly misses (2,3), (5,7), (12,17), (29, 41), ete. 
Which dots are close to a line of irrational slope s? How can we find 
good rational approximations to real numbers like e and 1? 


The answer is given by continued fractions. A continued fraction is an 
expression of the form 


Lo t+ 


a+ 
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or, equivalently, 


wtf (avi /(mrt/( -+1/(tn- 1+1f29)--))). 


Since these expressions are extremely unwieldy, they are rewritten as 
[[zo, UP, oe ples Gall ‘ (3.23) 


In other words, [[zo]] = Xp and 


1 
[[0, 15+, 0a, ei |] = [0.2 ve DET | (3.24) 
Li+1 


for natural numbers i. In order to recover a fraction from (3.23), define 
p; and q; for i > —2 by the initial setting 


p-1 p-2 10 
a 3.25 
is | fi 1 oe) 
and the recurrence 
Di = LiPi-1 + Pi-2 (3.26) 
Gi = LViGi-1 + G-2 (3.27) 


for all natural numbers 7. 


(a) Use induction to prove that 


Pk-1 Pk-2 k 
=(-1 3.28 
dk—-1 Yk—2 ( ) ( ) 

for all natural numbers k. 
(b) Consider the equation 
Pk 

U0, 1,---,;Lk-1,Lk|| = — 3.29 
[0.21.22] = 2 (3.25) 


for natural numbers k. Show that it holds for k = 0. If it holds for 
k =n, use (3.26) and (3.27) to deduce that 


TnPn—1 T Pn—-2 
Ln dn—1 T In—2 


Eocene = (3.30) 
Replace x, by tp, + 1/an41 on both sides of (3.30). Use (3.24) 
to rewrite the left-hand side, and simplify the right-hand side to 
obtain (3.29) with k = n+ 1. Conclude that (3.29) holds for all 
natural numbers & by induction. As a corollary, deduce that (3.30) 
holds for all natural numbers n. 
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(c) Use (3.29) to express [[1,2,2]] and [[1,2, 2, 2]] as rational numbers. 


3. Approximating irrationals. For a real number 2, the floor || is 
defined to be the largest integer | with 1 < x. The fractional part of x 
is x— |a|. 


(a) If x is an irrational number, show that the fractional part is an 
irrational number, with 0 < #— |2| <1. 


(b) If # is an irrational number, show that the reciprocal (x—|a|)~1 of 
the fractional part of x is an irrational number with 1 < (a—|a|)7. 


(c) Let s be an irrational number. Define a sequence of real numbers 
8; by So = s and 
Sit = (95 — |si])~* 
for i in N. Show that s, is irrational for all natural numbers k. 
(d) Show that 1 < |s;,]| for all positive integers k. 
(e) Show that s; = |s;| +1/s;41 for all natural numbers 7. 
(f) Show that 


s = [[[so], [s1J,---, Lsi-a], [si], sia] (3.31) 


for all natural numbers 7. 


(g) Setting x; = |s;| for all natural numbers 7, consider the numbers 
p; and q; defined by (3.25) through (3.27) above. Show that p; and 
dk are integers, with q, > k for all natural numbers k. 

(h) Use (3.31) (with ¢ = &), and (3.30) (with n = k+ 1, and 5,4, in 
place of x,41), to show that 


s= Sk4+1Pk 1 Pr-1 (3.32) 


Sk+19k 1 Wk-1 


(i) Use (3.32) to show that 


Dk —1 


Pk Pk-1 
Gk Uke (Sk419% + Ye-1) 


dk dk-1 


(j) Using (3.28), along with the inequalities s,,, > 1 from (b) and 
dx = k from (g) above, conclude that the irrational number s is 
approximated by the rational number p;/q, to within a tolerance 
given by 


for all positive integers k. 


66 


Introduction to Abstract Algebra 


(k) Use continued fractions to compute some rational approximants to 
e and a. For example, with s = so = 7 = 3.14159..., we have 
8, = (w — 3)7! = 7.06251..., so r9 = 3, 21 = 7, and 
22 
[[to, ei] = [3,7] =>. 


(1) Compare your approximants from (k) with those that are given by 
truncating the series 


for e and 


3.8 


Notes 


Section 3.5 


Many authors use the notation Zz to denote the set of integers modulo 2 
(and similar notation for other moduli). However, this notation gives no hint 
of the inherent quotient structure (Example 5.20, page 103). Furthermore, it 
clashes with the standard notation Zz for the set of dyadic integers (Study 
Project 3 in Chapter 5). 


Chapter 4 


GROUPS AND MONOIDS 


Chapter 2 showed how sets of functions could form semigroups, monoids, or 
groups. Many other sets have a similar structure, even though they do not 
consist of functions. In order to study the structure, the key properties of the 
sets of functions are abstracted and formulated in general terms. 


4.1 Semigroups 


If S is a semigroup of functions, then we may consider function composition 
as a map 
SxS—S;(9,flogof (4.1) 
whose domain is the set S x S of ordered pairs (g, f) of elements of S. The 
closure (2.9) under composition guarantees that S may serve as the codomain 
of the map (4.1). Recall that function composition is always associative. 
The abstract properties of semigroups of functions are then captured by the 
following definition. 


DEFINITION 4.1 (Semigroups.) Let S be a set equipped with a map 
SxS S3(a,y)rea-y (4.2) 


assigning an element x-y or xy of S to each ordered pair (x,y) of elements 


of S. 


(a) In general, the map (4.2) is known as a multiplication on S' or (more 
formally) as a binary operation on S. 


(b) The existence of such a map is described as the closure of the set S with 
respect to the multiplication. 


(c) The pair (S,-) consisting of the set S with its multiplication - is called 
a semigroup (or an abstract semigroup) if the associative law 


ay z)=(@-y)-2 (4.3) 
holds for all elements x, y, and z of the set S. 
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DEFINITION 4.2 (Commuting elements.) Two elements x and y of 
a semigroup (S,-) are said to commute if x-y = y- x. The semigroup (S,-) 
is said to be commutative ifx-y=y-a« for allx, yin S. 


Example 4.3 (The real interval (1, 0o).) 

Let S be the set or interval (1, co) of real numbers x with x > 1. Then S' forms 
a semigroup under the usual (associative and commutative) multiplication of 
real numbers. 


Example 4.4 (Irrationals.) 

Let S' be the set of irrational real numbers. Then S does not form a semi- 
group under the usual associative multiplication of real numbers, since V2 is 
a member of S (compare Theorem 3.9, page 55), but x-a = 2 is nota member 
of S. 


Although the binary operation (4.2) on a general semigroup S' is called a 
“multiplication,” it does not have to be an actual multiplication of numbers 
in the usual sense. Here is one example. For another, see Example 4.9 below. 


Example 4.5 (Positive integers.) 
Let S' be the set of positive integers. Define a “multiplication” on S by 


m:n = gced(m,n). (4.4) 
Then (S,-) forms a commutative semigroup. l] 


Function composition is always associative. But with general operations as 
in (4.2), you should be careful not to take associativity for granted, even when 
you are on familiar ground. 


Example 4.6 (Integers under subtraction.) 
Consider the set Z of integers. Then Z is closed under the operation of 
subtraction: 

ZxXZ—Z;(x,y)eau-y. 


However, Z does not form a semigroup under subtraction, since subtraction 
is not associative. Indeed, 


3-—(5-4)=3-1=2, 


while 
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A semigroup S' of functions always forms a semigroup (S,o), with function 
composition as the “multiplication,” in the sense of the abstract Definition 4.1. 
In general, this multiplication is not commutative (compare Example 2.3, page 
29). However, Exercise 43 in Chapter 2 shows that disjoint cycles commute. 

A semigroup S of functions may also provide the underlying set for an 
abstract semigroup structure with a multiplication which is different from the 
composition of functions. For example, the set P of power maps 


Pn: ROR area” 


for natural numbers n (compare Exercise 10 in Chapter 2) certainly forms a 
semigroup of functions. On the other hand, it also forms a semigroup under 
the usual “componentwise” function multiplication of calculus: 


Pm* Pn = Pm+n 


or 
m n m+n 


(Pm * Pn)(£) = Pm(X)- pala) =a" - a2" =x = Dmtn(2) 
for x in R. Compare this with 


Pm ° Pn = Pmn 


for the function composition. 


4.2 Monoids 


A monoid of functions on a set X is a semigroup of functions on X that 
contains the identity function id, on X. As far as function composition is 
concerned, the key property of the identity function is (2.12). This property 
is abstracted by (4.5) below. 


DEFINITION 4.7 (Abstract monoids.) Let (M,-) be a semigroup with 
- as multiplication. Then M is said to form a monoid (or an abstract monoid) 
(M,-,e) if it contains an element e satisfying 


€-L=xX=L2-e (4.5) 
for allx in M. The element e is known as the identity element of the monoid 


M. (Proposition 4.13 below shows that the identity element is unique.) 


Example 4.8 (The real interval (1,00).) 
The semigroup S = (1,00) of Example 4.3 does not form a monoid. Certainly 
S does not contain the usual identity element 1 for the multiplication of real 
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numbers. In fact, for each element e of S, we have e- ax > x for all x in S. 
Thus no element e of S can satisfy the identity property (4.5). l] 


Example 4.9 (Natural numbers.) 


The set N of natural numbers forms a commutative monoid under addition, 
with 0 as the identity element. On the other hand, the semigroup of positive 
integers under addition does not form a monoid (Exercise 4). 


Example 4.10 (Least common multiples.) 


Let S be the set of positive integers. Define a “multiplication” on S$ by 
m-n=l|em(m,n). (4.6) 


Then (S,-,1) forms a commutative monoid. l] 


Example 4.11 (Intersection.) 


Let n be a natural number. Let P(nm) denote the set of subsets of the set 
n of natural numbers less than n — compare (2.26). Then P(n) forms a 
commutative monoid under the “multiplication” M of intersection: 


XY = {z| z in both X and Y} 
with the full set n as the identity element. (See Exercise 6.) ll 


Example 4.12 (Matrices.) 


The set R3 of all 2 x 2 real matrices forms a monoid under the operation 
of matrix multiplication. The identity element is the 2 x 2 identity matrix 
(2.13). Note that this monoid is not commutative (Exercise 9). 


Definition 4.7 refers to the identity element of a monoid. The definite article 
is justified by the following result. 


PROPOSITION 4.13 (Uniqueness of the identity.) 


Let M be a monoid. Ife and f are identity elements of M, thene = f. Thus 
the identity element of a monoid is unique. 


PROOF We have e =e-f = f. The first equality holds since f is an 
identity element. The second equality holds since e is an identity element. 
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4.3. Groups 


Recall that functions are bijections if and only if they are invertible. Thus 
a monoid G of functions on a set X is a group of permutations of X if and 
only if each element f of G has an inverse element f~! in G, with 


fof-Vaidy =f ay 
This property is abstracted by (4.7) below. 
DEFINITION 4.14 (Abstract groups.) A monoid (G,-,e) is a group 
(or an abstract group) if each element x of G has an inverse x~! in G with 
i oe =e -7 * Ge (4.7) 


In other words, a group (G,-,e) is a set G with a multiplication - satisfying 
the following properties: 


Closure: x-y lies in G for all x, y in G; 
Associativity: x-(y-z)=(x-y)-z for all x, y, z in G; 


Identity: There is an element e in G withe-x =x =2-e for all x inG; 


Inverses: For each x in G, there is x~! inG withx-a-'|=e=a7}-a. 


Commutative groups are often described as abelian. 


Consider the four properties of Definition 4.14. Note that semigroups are 
required to satisfy the closure and associativity, while monoids are required 
to satisfy closure, associativity, and the identity property. 

The uniqueness of inverses of invertible functions (Proposition 2.23, page 
35) has its abstract counterpart in groups (see Exercise 11 for the proof): 


PROPOSITION 4.15 (Uniqueness of inverses.) 


In a group G, each element x has a unique inverse. 


We now consider some examples of groups. 


Example 4.16 (Real numbers under addition.) 

The real numbers form a group (R,+,0) with addition as the commutative 
“multiplication” operation. The “inverse” or additive inverse of a real number 
r is its negation —r. 
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In general, if the “multiplication” in a group G is denoted by an addition 
+ as in Example 4.16, the group G is described as additive. Its identity 
element is written as a zero, and the inversion is described as negation. These 
conventions are normally reserved for abelian groups. 


Example 4.17 (Muliplication of nonzero reals.) 


Under multiplication, the nonzero real numbers form a commutative group 
(R*, i) 1). 


Example 4.18 (The general linear group.) 

Let GL(2,R) be the set of invertible 2 x 2 matrices with real entries. Then 
GL(2,R) forms a nonabelian group under the usual matrix multiplication, 
having the identity matrix Iz as the identity element of the group, and with 
the usual inversion of matrices. The group GL(2, R) is called the (real) general 
linear group of dimension 2. 


Examples 4.17 and 4.18 are special cases of a general source of abstract 
groups: sets of invertible elements of monoids. 


DEFINITION 4.19 (Invertible elements.) Let (M,-,e) be a monoid. 
An element u of M is said to be invertible or a unit if there is an element v 
of M such thatu-v=e=v-u. 


PROPOSITION 4.20 (Invertible elements form a group.) 
Let (M,-,e) be a monoid. Then the set M* of invertible elements of M forms 
a group (M*,-,e). 


PROOF We verify the four conditions listed in Definition 4.14. 
Closure: Suppose that u; and uz are units of M, with u,v, = e = v,u, and 
UgU2 = € = Voug. Then 


(u1u2)(v2v1) = UUgvev1 = U1Lev] = U1U1 = € (4.8) 


and 


(v1V2)(Ugu1) = Vi VgUgUy = V{EU, = V{U1 =e, (4.9) 


SO U1 U2 is also a unit. 
Associativity: The associativity of the multiplication in M™* is a special case 
of the associativity of the multiplication in the monoid M. 
Identity: The identity element of M™* is the identity element e of M: The 
equations 

e-wu=uUu=Uu-e 


in M* are special cases of the equation (4.5) in M. 
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Inverses: By the definition of the set M*, each element u of M* has an 
inverse v. Note that v lies in M™*, since it has wu as its inverse. 


DEFINITION 4.21 (The group of units.) For a monoid (M,-,1), the 
group (M*,-,1) is known as the group of units of the monoid M. 


Example 4.22 (Integers under multiplication.) 
The integers form a commutative monoid (Z,-,1) under multiplication. The 
group of units of the monoid of integers is {+1}. 


Example 4.23 (Invertible real numbers.) 

The notation of Definition 4.21 is consistent with Example 4.17: the set of 
units of the monoid of real numbers under multiplication is the set R* of 
nonzero real numbers. 


Example 4.24 (The group of units of a group.) 

If (G,-,1) is a group, then it is certainly a monoid: Just forget the inversion. 
Now (G,-, 1) is its own group of units. In particular, Proposition 4.15 and the 
proof of Proposition 4.20 — specifically (4.8) and (4.9) — yield the important 
formula 
Muy (4.10) 


(uju2)~* = Uy 


for elements u,, ug of a group G. Note the generalization of (4.10) given in 
Exercise 16. 


4.4 Componentwise structure 


Starting with given examples of semigroups, monoids, or groups, there are 
methods to obtain new semigroups, monoids, or groups from the given ones. 
One such method is the direct product construction. Recall that for sets X 
and Y, the (external) direct product or product of X and Y is the set 


XxY={(x#,y)|ae in X, y in Y} (4.11) 


of ordered pairs (x,y) of elements « from X and y from Y (Figure 4.1). In 
this context, the sets X and Y are known as the (direct) factors of the direct 
product. The set X x Y is sometimes called the cartesian product of X and 
Y, since it follows René Descartes’ recipe for constructing the real plane as 
RxR. Recall that two ordered pairs (x, y) and (x’, y’) are equal if and only if 
x=x' andy=y’. We write X? for X x X, dsecribing it as the direct square 
of the set X. 
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FIGURE 4.1: The product X x Y. 


Now suppose that X is a semigroup under the multiplication ox, while Y is 
a semigroup under the multiplication oy. We may then define a multiplication 
on X x Y by 


(%1,Y1) Oxxy (2, y2) = (%1 Ox La, yi OY Y2) (4.12) 


for #1, v2 in X and yi, y2 in Y. The multiplication (4.12) is described as a 
componentwise multiplication, since it works individually on the respective x- 
and y-components of the ordered pairs. The following result is easily verified 
(Exercise 18). 


PROPOSITION 4.25 (Direct product semigroup.) 
Let (X,0ox) and (Y,oy) be semigroups. Then under the componentwise mul- 
tiplication (4.12), the direct product X x Y forms a semigroup. 


DEFINITION 4.26 The semigroup (X x Y,oxxy) of Proposition 4.25 is 
called the (external) direct product of the semigroups (X,ox) and (Y,oy). 


Example 4.27 (The real plane.) 
The set R of real numbers forms a semigroup under multiplication. Then the 
real plane R? forms a semigroup under componentwise multiplication. (] 


If the semigroups (X,0x) and (Y, oy) are monoids, with respective identity 
elements ex and ey, then the componentwise identity element is the element 
exxY = (Ex, ey) (4.13) 


of X x Y. The following result, readily verified, is similar to Proposition 4.25. 
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PROPOSITION 4.28 
Let (X,0x,ex) and (Y,oy,ey) be monoids. Then under the componentwise 
multiplication (4.12), the direct product X x Y forms a monoid 


(X x Yoxxy,exxy) 
with the componentwise identity element (4.13). 
DEFINITION 4.29 (The direct product of two monoids.) The 


monoid (X xY,oxxy,exxy) of Proposition 4.28 is called the (external) direct 
product of the two monoids (X,ox,ex) and (Y,oy,ey). 


The final step in our examination of componentwise structure considers 
groups. Suppose that (X,ox,ex) and (Y,oy,ey) are groups. Then for an 
element (a, y) of X x Y, define the componentwise inverse 


(,y) > = (ey) (4.14) 
as an element of X x Y. 
PROPOSITION 4.30 


Let (X,ox,ex) and (Y,oy,ey) be groups. Then the direct product X x Y 
forms a group 


(X x Ywoxxy,exxy) 


under the componentwise multiplication (4.12), the componentwise identity 
element (4.13), and with componentwise inverses (4.14). 


For the proofs of Propositions 4.28 and 4.30, see Exercise 19. 


DEFINITION 4.31 (The direct product of two groups.) The group 


(X x Y,oxxy,exxy) 
of Proposition 4.30 is called the (external) direct product of the two groups 


(X,ox,ex) and (Y,oy,ey). 


Example 4.32 (The real plane.) 
The set R of real numbers forms a group under addition. Then the real plane 
R? forms a group under componentwise addition: 


(11, y1) + (@2, yo) = (1 + 22, y1 + yo). 


Note that this is just the usual addition for 2-dimensional real vectors. (] 


76 Introduction to Abstract Algebra 


The following theorem provides a good illustration of how componentwise 
structure is used. 


THEOREM 4.33 (Groups of units of products.) 

Let (Mj,-,e1) and (Mo,-,e2) be monoids. Then the group of units (M, x M2)* 
of the product monoid M, x Mg is the product Mf x Mz of the groups of units 
My, Mz of the respective factors My, Mo. 


PROOF The sets (M, x M2)* and My} x M3 are both subsets of the product 
M, x Mz. To prove the equality of the two subsets, it will be shown that each 
contains the other. 

Let (ui, u2) be an element of Mj x M3. Thus there are elements v; of M1 
and v2 of M2 such that 


UjV, =e, = Vy, and = ugve = €o = VoUg. (4.15) 
Then in the product monoid (M, x M2,-,(e1, €2)), we have 
(ui, U2)(v1, v2) = (e1, €2) = (v1, v2) (ui, U2), (4.16) 


so that (u1, U2) lies in (M, x Mp2)*. 
Conversely, suppose that (ui, u2) lies in (IM, x M2)*: There is an element 
(v1, U2) of My, x Mz such that (4.16) holds. In particular, 


(u1v1, U2v2) = (€1, €2) = (vit, V2U2) . 
The equality of the corresponding first components in this equation gives 
UW1U1 = €1 = V1U1, 


so that u, lies in Mj. Examination of the second components shows that ue 
lies in M3. It follows that (ui, u2) is an element of Mf x M3. l] 


It is relatively straightforward to extend the product constructions to larger 
numbers of factors. For example, a product X x Y x Z of sets X, Y, and Z 
may be built recursively as X x (Y x Z), or directly as the set 


XxYxZ={(a,y,z)|e in X, y in Y, z in Z} 


of ordered triples. The product X x X x X is known as the (direct) cube X? 
of the set X. For instance, the direct cube R® of the additive group (R, +, 0) 
of real numbers, with componentwise structure, is the group of 3-dimensional 
vectors. 

Componentwise structure is not limited to n-tuples. For example, the set 
IR3 of 2 x 2 real matrices carries a componentwise additive group structure, 
with addition given by the usual addition 


bi dye Q11 412 by, + a1 bi2 + a42 
ke p| B | + d21 be2 + a22 ( ) 


GROUPS AND MONOIDS AE 


of matrices. The same set carries a componentwise monoid structure, with 
multiplication given by the componentwise multiplication 


by1 by2 m Q11 412 = 611411 by24a12 (4 18) 
bi bee a21 422 621421 b22A22 : 


of matrices, and with the 2 x 2 all ones matrix 


as the identity element. The matrix product (4.18) is called the Hadamard 
product. It is certainly different from the usual matrix multiplication (2.7). 
For an application, see (7.27), and also Study Project 5 in Chapter 5. 


4.5 Powers 


Another source of componentwise structure is found in sets of functions 
f:X — S from a certain domain X to a codomain S that carries algebraic 
structure. For example, in calculus the componentwise sum jf + g of two real- 
valued functions f : R— R and g: R — R is determined by the specification 


(f + 9)(x) = f(a) + g(x) 


for all x in R. Under this operation, the set R® of all real-valued functions 
forms an additive group, with the constant function zero as the zero (identity 
element), and with the inverse of a function f given by the negation —f, so 
that 


(—f)(@) = -f(@) 


for all real x. Here is the general definition. Verification of the claims embod- 
ied in the definition is deferred to Exercise 22. 


DEFINITION 4.34 (Power structures.) Let X and S be sets. Consider 
the set S* of all functions f : X — S from X to S. 

(a) If S carries a semigroup structure (S,-), then the X-th power (S,-)* or 
S* of the semigroup (S,-) is the set S* equipped with the componentwise 
multiplication f -g given by 


(f- g)(@) = f(x): g(@) 


for xz in xX. 
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(b) If S carries a monoid structure (S,-,e5), then the X-th power (9,-,e5)* 
or S* of the monoid (S,-,es) is the X-th power semigroup (S*,-), with 
the constant function BE: X — S;2%+ eg as the componentwise identity 
element. 


(c) If S carries a group structure (S,-,e5), then the X-th power (S,-,es)* 
or S* of the group (S,-,eg) is the X-th power monoid (S*,-, E), with 
the componentwise inverse of a function f : X > S given by f-'(x) = 
f(x)~+ for each x in X. 

If X is the n-element set n = {0,1,...,n—1} for a positive integer n, then 
the powers S™ are known as the n-th powers S”. 


Example 4.35 (Bit strings.) 
Let n be a positive integer. A bit string of length n is an element 


b = bn—1bn—2... b2b1bo9 


of (Z/2)” (with b; as the value b(t) of ¢ for 0 <i <n). For example, the bit 
string b might have been obtained as the binary (or base 2) expansion of the 
natural number 


Bn 12" | + by 22-7 H+ + bn2? + B12! + b92° 


(compare Exercise 19 in Chapter 1). According to Definition 4.34, the set 
(Z / 2)” of bit strings of length n inherits respective group structures under + 
and monoid structures under - from the addition and multiplication modulo 
2 on Z/> displayed in Figure 3.3. 


Example 4.36 (Vectors.) 
Let n be a positive integer. An n-vector or n-dimensional real vector is an 
element 


(0, £1, see (tn) 
of the power group R”. For example, in special relativity a 4-vector 
(ct, 1,02, r3) 


represents an event at time ¢ and spatial location (a,,x%2,x3) in a certain 
frame of reference, c being the speed of light. 


4.6 Submonoids and subgroups 


Componentwise structure on product sets, as was studied in the preceding 
sections, is one rich source of new semigroups, monoids, and groups. Another 
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is found from subsets that are closed under the given structure. Let (S,-) be 
a semigroup. Let X be a subset of S$ with the closure property 


x,y in X implies x-y in X. (4.19) 


Then X forms a semigroup under the multiplication - inherited from S. The 
closure property is given by definition, and the associativity of (X,-) is just a 
special case of the associativity given in the semigroup (S,-). 


DEFINITION 4.37 (Subsemigroups.) Let S be a semigroup, and let 
X be a subset of S. Then X is described as a subsemigroup of the semigroup 
S if it satisfies the closure property (4.19). 


Trivially, the empty set is a subsemigroup of every semigroup. 


Example 4.38 (Subsemigroups of the integers under addition.) 
The set of negative integers forms a subsemigroup of the semigroup (Z, +) of 
integers under addition. The set of odd integers does not form a subsemigroup, 
since the closure property is violated by examples such as 1 +3. 


DEFINITION 4.39 (Submonoids.) A subset X of a monoid (M,-,e) 
is said to be a submonoid if it is a subsemigroup of the semigroup (M,-), and 
if it contains the identity element e of M. 


If (X,-,e) is a submonoid of a monoid (M,-,e), then (X,-,e) is itself a 
monoid: The identity property (4.5) for X is just a special case of the identity 
property (4.5) for M. Trivially, the set {e} consisting only of the identity 
element is a submonoid of any monoid (M,-,e) with e as its identity element. 
Note that {e} is a subsemigroup by the identity property: e-e = e. 


Example 4.40 (Submonoids of the integers under addition.) 

The subsemigroup of negative integers does not form a submonoid of the 
monoid (Z,+,0) of integers under addition, since it does not contain the 
identity element 0 of Z. On the other hand, the monoid (N,+,0) of natural 
numbers under addition (compare Example 4.9) does form a submonoid of 
(Z,+,0). 


Example 4.41 (Stochastic matrices.) 
A 2x 2 real matrix 
A= ie A 


M1 42 
is said to be (row) stochastic if py, p2,q1,q2 are all nonnegative, 


pPitpe=1, and atqa=l. 
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Note that the identity matrix Ip is stochastic. Let II$ be the set of 2 x 2 
stochastic matrices. Then II3 forms a submonoid of the monoid R3 of all 2 x 2 
matrices under matrix multiplication. (Compare Exercise 24.) (] 


DEFINITION 4.42 (Subgroups.) A submonoid X of a group (G,-,e) 
is said to be a subgroup of G if it is closed under the inversion in G: 


x inx implies a' in X. (4.20) 


Note that the set {e} consisting only of the identity element is a subgroup 
of any group (G,-,e) with e as its identity element. Since a subgroup has to 
be a submonoid, with an identity element, it has to be nonempty. There is a 
quick way to check if a given nonempty subset X of a group G actually forms 
a subgroup of G. 


PROPOSITION 4.43 (The subgroup test.) 
Let X be a nonempty subset of a group (G,-,e). Then X is a subgroup of G 
if and only if it satisfies the closure property 


x,y in X implies zy in X. (4.21) 


PROOF First, suppose that X is a subgroup of G, and that x and y are 
elements of X. Then by the closure (4.20) under inversion, y~! lies in X. 
Since x and y~! lie in X, the closure property (4.19) then guarantees that 
x-y' lies in X. 

Conversely, suppose that the nonempty subset X of the group G satisfies 
the closure property (4.21). Since X is nonempty, it contains an element a. 
Then the closure property (4.21) shows that the identity element e = a-a~ 
lies in X. Again, for each element x of X, the closure property (4.21) shows 
that the inverse x~! = e-x~! lies in X. Finally, for x and y in X, the closure 
property (4.21) shows that the product x+y = x- (y~')~! lies in X, so that 
X forms a subsemigroup of (G,-). l] 


REMARK 4.44 In an additive group (G,+, 0), the closure property (4.21) 
reduces to closure under the subtraction 


v—y=2+(-y) 


in G. If the operation of a group G is written as multiplication, it is sometimes 
convenient to define x/y = x-y~!, an operation known as right division. l] 


Example 4.45 (Orthogonal matrices.) 
A 2x 2 (or larger square) matrix A is said to be orthogonal if the products 
AA? and A” A of A with its transposed matrix A’ are the identity matrix J. 
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In particular, the inverse of an orthogonal matrix A is its transpose A’, and 
the identity matrix I is orthogonal. Then by Proposition 4.43, the nonempty 
set O2(R) of orthogonal 2 x 2 real matrices forms a subgroup of the gen- 
eral linear group GL(2,R) (compare Example 4.18). Indeed, if A and B are 
orthogonal, the computations 


(ABT)(ABT)? = ABT(BT)? AT = ABTBAT = AAT =1 


and 
(ABT)? (AB?) = (BT)? AT ABT = BBT =I 


show that AB? = AB™! is orthogonal. The group O2(R) is called the (real) 
orthogonal group of dimension 2. 


We conclude this section with a classification of the subgroups of the group 
of integers under addition. 


THEOREM 4.46 (Subgroups of the integers.) 

Let J be a subgroup of the group (Z,+,0) of integers under addition. Then 
there is a natural number d such that J consists of the set dZ of integral 
multiples of d. 


PROOF Since J is a subgroup, it contains the identity element 0. If 
J = {0}, then J = 0Z, the set of multiples of 0. 

Otherwise, J contains a nonzero integer n. In this case it contains a positive 
integer (either n or —n). Consider the nonempty set S of positive elements 
of J. By the Well-Ordering Principle, the set S has a least element d. Then 
each integer multiple nd of d lies in J. Indeed if n is positive, nd is the sum 


n 


oD 
d+d+-.-+d 


of n copies of d, which lies in J since J is closed under addition. If n is 
negative, then nd = |n|(—d) is the sum 


of |n| copies of —d, which lies in J since J is closed under negation and 
addition. Finally, 0d = 0 lies in J. 

Now suppose that J contains an element a which is not a multiple of d. 
Apply the Division Algorithm to express a as a = dq+rwithO<r<d. 
Then 0 < r = a—dq = a+(—dg) lies in J by Proposition 4.43. This contradicts 
the choice of d as the smallest positive element of J. 
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4.7 Cosets 


A semigroup (G,-) carries an associative multiplication of its elements. It 
is very useful to extend this multiplication to subsets of G. Let X be a subset 
of a semigroup (G,-). If g is an element of G, define 


Xg={xg|«x in X} and gX ={gx|a in X}. (4.22) 


The sets of (4.22) are known respectively as the right and left cosets of the 
subset X with the element g. For example, the subgroup dZ of the group 
(Z, +,0) in Theorem 4.46 is the coset of din the semigroup (Z,-). The notation 
(4.22) is extended by setting XY or 


X:-Y={a-y|x in X,y in Y} (4.23) 


for subsets X and Y of a semigroup (G,-). In particular, Xg = X - {g} and 
{g}-X = gX for an element g of G. 

If X is a subset of a monoid G with identity element e, then the cosets eX 
and Xe both coincide with the subset X. There are further relations between 
the various cosets in a group (G,-). 


PROPOSITION 4.47 (Group cosets are isomorphic as sets.) 
Let X be a subset of a group G. Then for elements gi, go of G, the cosets 
Xgi, Xge2, and gi:X are all isomorphic as sets. 


PROOF The maps 
XOX CH wg 


and 
Xgi > XsyHs ygy* 


are mutually inverse bijections, so X = Xg:. Isomorphism is an equivalence 
relation (compare Exercise 1 in Chapter 2). It follows that Xg,; and X gp are 
isomorphic. Similarly, the maps 


X > Qx;cr gir 


and 

nX + Xsyrgrty 
are mutually inverse bijections, so X = g,;X. The rest of the proposition 
follows from the fact that isomorphism is an equivalence relation. 


Since two finite sets are isomorphic if and only if they have the same number 
of elements (compare Example 2.26, page 36), we obtain the following. 
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COROLLARY 4.48 (Finite cosets are all the same size.) 
Let X be a finite subset of a group G. Then for elements gi, go of G, the 
cosets Xgi, Xg2, and gi X all have the same number of elements. 


Cosets of subgroups are equivalence classes. 


PROPOSITION 4.49 
Let H be a subgroup of a group G. 


(a) Define a relation R on G by 
gi R go if and only if hgi = g2 for some h in H. 
Then R is an equivalence relation on G. 
(b) The equivalence classes for R are the right cosets Hg. 
PROOF (a): Reflexivity: For g in G, we have eg = g with e in H. 
Symmetry: Suppose g,; R go, say hg; = go with h in H. Then g; = h~'go, 
so go R qn. 


Transitivity: Suppose g,; R gz and go R g3, say hg, = gz and h’ go = g3 with 
h, hb! in H. Then h’'hg; = h’g2 = g3, so gi R gs. 


(b) is immediate. l 
In the proof of Proposition 4.49(a), it is worth noting the parallel between 


the three properties required for the equivalence relation R and the three 
closure properties of the subgroup H: 


Equivalence relation is... | Subgroup ... 
... reflexive ... contains the identity 
... Symmetric ...is closed under inverses 
... transitive ...is closed under multiplication 


The Klein 4-group V, is a subgroup of the 24-element group $4 (compare 
Exercise 39 in Chapter 2). In general, the number of elements of a subgroup 
of a finite group is always a divisor of the total number of elements in the 
group. 


THEOREM 4.50 (Lagrange’s Theorem.) 
Let H be a subgroup of a finite group G. Then the number |H| of elements 
of H divides the number |G| of elements of G. 
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PROOF By Propositions 4.49 and 3.5, two distinct right cosets of H are 
disjoint. Suppose that there are j right cosets altogether. By Corollary 4.48, 
each right coset has |H| elements. Then 


|G| = 3|H], (4.24) 
so |H| divides |G]. ( 


The number j = |G|/|H| in (4.24) is called the index of H in the group 
G. More generally, if G is an infinite group with a subgroup H, the index 
of H is the (possibly infinite) number of right cosets of H in G. (Compare 
Exercise 38.) 

Lagrange’s Theorem is useful for limiting the possible subgroups of a given 
finite group. For example, it shows that in the 24-element group $4, a sub- 
group cannot be formed from the 9-element set consisting of the identity and 
the 8 permutations (2.33). In other words, without any calculation required, 
it shows that this 9-element set cannot be closed under multiplication. 

In any group G with identity element e, the subgroup G is described as 
improper, while the smallest subgroup {e} is described as trivial. A sub- 
group H is proper if it is not improper. Since prime numbers are irreducible 
(Proposition 1.11), Lagrange’s Theorem yields the following result. 


PROPOSITION 4.51 (Groups of prime order.) 
A group with a prime number of elements can have no proper, nontrivial 
subgroups. 


4.8 Multiplication tables 


There are various ways to compute the product «-y of two elements x and 
y ina group G. If Gis a group of matrices, we may use matrix multiplication. 
If G is a group of permutations, we may use function composition. A general 
method uses a table, the multiplication table of the group. 

Consider the Klein 4-group V4 of Example 2.31 (page 38). Writing the 
elements as (0) = e, (0 1)(2 3) = a, (0 2)(1 3) = 8, and (0 3)(1 2) =c, the 
multiplication table appears as displayed in Figure 4.2. The table consists 
of four parts, separated by the lines. The top left-hand corner may contain 
the name of the group or its multiplication. The top right part consists of 
the column labels. The bottom left-hand part consists of the row labels. The 
bottom right-hand part is called the body. Note that in Figure 4.2, the group 
elements are presented in the same order as column and row labels, with the 
identity element appearing first. This is not necessary for a table to perform 
its function of specifying the group products, but it is a convention that is 


GROUPS AND MONOIDS 85 


V4 e a 6b ec 
€ e€ a b Cc 
a a e€ cb 
b b c e a 
c c b a e 


FIGURE 4.2: Multiplication table of V4. 


usually followed. Putting the identity element first as a column and row label 
means that the first row and the first column of the body just repeat the 
respective column and row labels in order. For this reason, the body alone 
may be used to specify a group, the column labels being taken from the first 
row of the body, and the row labels from the first column of the body. 

The body of the table in Figure 4.2 has a particular feature: each row and 
each column of the body contains each element of the group exactly once. 


DEFINITION 4.52 (Latin squares.) For a natural number n, let Q be 
a set with n elements. Then annxn square containing each of the n elements 
of Q exactly once in each row and each column is called a Latin square (on 
the set Q). 


Theorem 4.55 below shows that the Latin square property of the body of 
the multiplication table of the Klein 4-group is actually typical of all finite 
groups. The theorem is preceded by a pair of results holding in general (not 
necessarily finite) groups. 


PROPOSITION 4.53 (Cancellation in groups.) 
Let G be a group, with elements x, y1, y2.- 


(a) If 
LeYy =X Ye, (4.25) 
then y, = yo. 
(b) If 
Yi- t= Yaz, 
then y, = yo. 


PROOF (a): Multiplying both sides of the equation (4.25) on the left by 
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x1, we obtain 


Y=Hey He ay Ha '- ey =e-yo=y- 


(b) is proved similarly (Exercise 41). (] 


COROLLARY 4.54 (Existence and uniqueness of solutions.) 
Consider the equation 
Uiy=z (4.26) 


in a group (G,-). If the equation (4.26) holds, knowledge of any two of the 
elements x, y, z specifies the third uniquely. 


PROOF If x and y are given, then z is specified uniquely by the multi- 
plication in G. If x and z are given, then a solution y to the equation (4.26) 
exists, namely y = x~!- z. Indeed: 


epee +2) = (gee) goes Se, 


The solution is unique, by Proposition 4.53(a). The existence and uniqueness 
of a solution x to (4.26) given y and z follow similarly (Exercise 42). 


THEOREM 4.55 (Group tables are Latin squares.) 
Let G be a finite group. Then the body of the multiplication table of G forms 
a Latin square on the set G. 


PROOF Consider the row of the table labeled by an element x of G. Let 
z be an element of G. Now z appears in the row labeled x, namely in the 
column labeled y, if and only if the equation (4.26) holds. By Corollary 4.54, 
a solution y to this equation exists, so z indeed appears in the row labeled 
xz. Moreover, the solution y is unique, so in the row labeled x, the element z 
only appears in the column labeled y, and not in any other column. 

A similar argument shows that each element appears exactly once in each 
column of the body of the multiplication table (Exercise 43). (] 


Note that in Figure 3.3 (page 59), the body of the group table on the left 
is a Latin square, while the body of the monoid table on the right is not. 


4.9 


1. 


10. 
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Exercises 


Let S be the union (—oo, —1) U (1,00) of the real intervals (—oo, —1) 
and (1,00). In other words, S is the set of real numbers x with || > 1. 
Show that S forms a semigroup under the usual multiplication of real 
numbers. 


. Verify the claims of Example 4.5. 


. Pick three integers J, m, and n at random (for example nonzero digits 


from the number of your telephone). See if there is a difference between 
l—(m—n) and (l—m) —n. 


(a) Show that the set of positive integers forms a semigroup under 
addition. 


(b) Show that for each positive integer e, the inequality e+ a > x holds 
for all positive integers x. 


(c) Conclude that the set of positive integers does not form a monoid 
under addition. 


. Verify the claims of Example 4.10. 
. Verify the claims of Example 4.11. 


. Show that the set P(m) (compare Example 4.11) forms a monoid under 


the operation U of set union: 
XUY={a|a in X oraz in Y}. 


What is the identity element of this monoid? 


. Let Pan(N) denote the set of finite subsets of N. 


(a) Show that Pan(N) forms a monoid under set union. 
(b) Show that Pgn(N) forms a semigroup under set intersection. 


(c) Show that Pan(N) does not form a monoid under set intersection. 


. Exhibit two real 2 x 2 matrices X and Y such that XY A YX. 


Consider the set L(2,R) of linear functions from R$ to itself. (Compare 
Exercise 2.9 in Chapter 2.) Show that L(2,R) forms a monoid under 
the addition defined componentwise by 


(La + Lp)(x) = La(x) + La(x) 


for x in R} and A, B in R32. 
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11. Prove Proposition 4.15: Rewrite the proof of Proposition 2.23 in abstract 
terms. 


12. Show that (Coe = x for each element of a group G. 


13. Let X be a set. What is the group of units of the monoid X* of 
functions? 


14. What is the group of units of the monoid (N,+,0) of Example 4.9? 


15. Determine the group of units of the monoid A of affine functions (see 
Exercise 13 in Chapter 2). 


16. Let uy, u2,..., Ur—1, Ur be elements of a group G. Show that 


1 1, 1 p= 
(W1Ug...Up—1Up) = UP Uy Un UY 


17. Let a, b, and c be elements of a group (G,-,e), with abc = e. Give a 
careful proof that cab = e. 


18. Prove Proposition 4.25: 


(a) Show that X x Y is closed under componentwise multiplication. 


(b) Show that the componentwise multiplication is associative. 
19. Prove Propositions 4.28 and 4.30: 
(a) Show that 
(ex, ey) oxxy (#,y) = (@,y) = (@,y) oxxy (ex, ey) 
for each element (x,y) of X x Y. 
(b) Show that 
(2~*,y~") oxxy (@,y) = (ex, ey) = (@,y) oxxy (277, y7") 
for each element (x,y) of X x Y. 


20. In the group R x R of Example 4.32, describe the identity element and 
inverses. 


21. Consider the set R3 of real 2 x 2 matrices. 


(a) Determine inverses, and the identity element, in the group given 
by matrix addition. 


(b) Determine the group of units within the monoid of matrices under 
Hadamard multiplication. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 
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In the respective parts (a), (b), and (c) of Definition 4.34, verify that 
S* forms a semigroup, monoid, and group under the componentwise 
multiplication. 


Let X be a set, and let S be a commutative semigroup. Show that the 
power S* is commutative. 


Show that the set II} of all 2 x 2 row-stochastic matrices (as defined in 
Example 4.41) forms a submonoid of the monoid R}3 of all real 2 x 2 
matrices under matrix multiplication. 


A square matrix is said to be column-stochastic if its transpose is row- 
stochastic. Show that the set of 2 x 2 column-stochastic matrices forms 
a monoid under matrix multiplication. 


A square matrix is said to be doubly stochastic if it is both row- and 
column-stochastic. 


(a) Give an example of a nonidentity 2 x 2 doubly stochastic matrix. 


(b) Show that the set of all 2 x 2 doubly stochastic matrices forms a 
monoid under matrix multiplication. 


(c) Determine the group of units within the monoid of 2 x 2 doubly 
stochastic matrices. 
ab 
cd 


is said to be upper triangular if c = 0. 


A 2x 2 matrix 


(a) Show that the set of upper triangular real 2 x 2 matrices forms a 
monoid N under matrix multiplication. 


(b) Determine the group of units N* of the monoid N. 


kk 
K= { f i | k real, nonzero} 


of all “constant” nonzero 2 x 2 real matrices. 


Consider the set 


(a) Show that K forms a subsemigroup of the semigroup of all 2 x 2 
real matrices under matrix multiplication. 


(b) Show that K does not form a submonoid of the monoid of all 2 x 2 
real matrices under matrix multiplication. 


(c) Show that K forms a group under matrix multiplication. 


90 


29 


30. 


3l. 


32. 


33. 


34. 


35. 


36. 


37. 
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Let H and K be subgroups of a group G. Show that the intersection 
HOOK of H and K (the set of elements of G common to both H and 
K) is also a subgroup of G. 


Let G be a group. For each element 7 of an “index set” J, let H; be a 
subgroup of G. Show that the intersection 


() H,={g|g in H; forall i in I} 
tint 
of the subgroups H; is also a subgroup of G. 


Let C(R) be the set of all continuous real-valued functions f : R > R. 
Show that C(R) forms a subgroup of the group R® of all real-valued 
functions with componentwise structure. 


Let r be a positive integer. Show that the set C’(R) of all real-valued 
functions f : R > R with a continuous r-th derivative f‘") forms a sub- 
group of the group R® of all real-valued functions with componentwise 
structure. 


For a real number @ (an angle in radians), show that the matrix 


cos @ — sin@ 
sin@ cos@ 


is orthogonal. 


For real numbers @ and y, show that the addition formulas for the cosine 
and sine are given by equating respective components on both sides of 
the product equation 


cos@ —sin@| [cosy —siny| _ |cos(@ + y) —sin(@ + ¢) 
sin@ cosé sing cosy | |sin(@+y) cos(6+ 9) 


in the orthogonal group. 


(a) Show that each subgroup of the group of integers under addition is 
a subsemigroup of the semigroup of integers under multiplication. 


(b) Exhibit an example of a subgroup of the group of real numbers 
under addition which is not a subsemigroup of the semigroup of 
real numbers under multiplication. 


Let H be a subgroup of a group G. Show that two right cosets of H are 
either equal or disjoint. 


Let H be a subgroup of a finite group G. Show that the number of left 
cosets of H in G is equal to the number of right cosets of H in G. 


38. 


39. 


40. 


Al. 
42. 


43. 
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Show that the group of even integers has index 2 in the group (Z, +, 0) 
of integers under addition. 


Let H and K be subgroups of a group G. Define a relation R on G by 
gi R go if and only if hgjyk = go for some h in H and k in kK. Show 
that R is an equivalence relation on G. (The equivalence classes Hg Kk 
are known as double cosets.) 


Let H be a subgroup of a group G. Let x and y be elements of G. Show 
that the cosets Hx and Hy are equal if and only if zy~! is an element 
of H. 


Prove Proposition 4.53(b). 


Let y and z be given elements of a group G. Show that there is a unique 
solution x in G to the equation x-y = z. 


Complete the proof of Theorem 4.55 by showing that each group element 
appears exactly once in each column of the body of the multiplication 
table. 


4.10 Study projects 


1. 


Nim sums. The game of Nim (see Figure 4.3) is played with a small 
collection of counters. The counters are arranged into several groups or 
heaps. There are two players, who take turns to play. When it is your 


FIGURE 4.3: <A Nim position with three heaps. 


turn, you are allowed to remove counters from only one heap. You must 
remove at least one counter. The player removing the last counter is 
the winner. 
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How can you win at Nim? The game is analyzed using bit strings and 
the power group (Z/2,+)" introduced in Example 4.35. Consider the 
position displayed in Figure 4.3. In the first heap, there are 5 counters, 
101 or 0101 in binary. In the other two heaps, there are 8 = 1000 and 
6 = 110 or 0110 counters respectively. Zeros are added as needed to 
make the lengths of all the strings equal. 


Now the Nim sum b+. c of two natural numbers b and c is defined 
by identifying natural numbers with their binary representations, and 
taking the sum of these bit strings in (Z/2,+)” (for any positive integer 
n with 2” > max(b,c)). 


(a) Show that the Nim sum b +2 c of two natural numbers b and c 
is independent of the number of zeros appended in front of their 
binary representations. 


(b) Show that the set of natural numbers forms a group (N,+2,0) 
under Nim sum. 


(c) Compute the Nim sum 5 +2 8 +2 6. 


(d) Show that you will not lose in Nim if after each move, you leave 
the Nim sum of the sizes of the heaps at 0. 


(e) Show that the unique winning move from the position in Figure 4.3 
is to remove 5 counters from Heap 2. 


2. The orthogonal group. Consider a 2 x 2 orthogonal matrix 
ab 
a=[e9). 


(a) Show that the orthogonality condition A7A = I reduces to the 
three equations 


a+ce=1 (4.27) 
P+dé=1 (4.28) 
ab+cd=0. (4.29) 


(b) Show that the solutions (a,b,c,d) to the simultaneous equations 
(4.27) through (4.29) are of two types, (cos 6, — sin 6, sin 6, cos 6) 
as in the picture on the left, or (cos 6,sin 6, sin @,—cos 6) as in the 
picture on the right: 
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(c) Show that the set 


s0u(R) = | [OTs ceo | Jo<e<znl 


sin@ cos@ 
forms a subgroup of the orthogonal group. (The group SO2(R) is 
called the special orthogonal group of dimension 2.) 


(d) Show that the orthogonal group is the union of SO2(R) and the 
coset 


1 0 
SO2(R) Ki E| ; 
3. Dihedral groups. Consider an integer n > 2. 


(a) Show that the set 
CG, = { fees 2 | oe nl 


sin(2rm/n) cos(2rm/n) 


forms a commutative subgroup of the special orthogonal group. 
(b) Show that the union 


1 0 
Dea cy ucn 9 


forms a noncommutative subgroup of the orthogonal group. (The 
group D,, is called the dihedral group of degree n.) 


(c) Consider the set 


ree { [Satz] [ose 
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of column vectors from R}. Using a column vector | to specify 
2 

the point (71,22) of the plane R?, show that the set P,, specifies 

the points of a regular n-gon in the plane. 


ex ae 


Ps 


cuca 


ace 


(d) For a matrix A in the dihedral group D,,, and for a column vector 


p in P,, (a vertex of the n-gon), show that the matrix product 


La(p) = Ap 


is again a vertex of the n-gon. Conclude that the dihedral group 
D, forms the full group of symmetries of the regular n-gon P,. 


4.11 Notes 
Section 4.3 


N.H. Abel was a Norwegian mathematician who lived from 1802 to 1829. 


Section 4.4 

R. Descartes was a French mathematician and philosopher who lived from 
1596 to 1650. J.S. Hadamard was a French mathematician who lived from 
1865 to 1963. 
Section 4.7 


J.L. Lagrange was a French mathematician who lived from 1736 to 1813. 


Chapter 5 


HOMOMORPHISMS 


A study of sets inevitably leads to a study of functions between sets. Similarly, 
a study of algebraic structures such as semigroups, monoids, or groups entails 
a study of the functions that preserve the algebraic structure. These functions 
are known as homomorphisms (literally “same shape”). 


5.1 Homomorphisms 


Consider the exponential function exp : R — R;z + e*. By the law of 
exponents, 
*+¥ — e* . eY = exp(z) - exp(y). (5.1) 


Now the domain of the exponential function is the semigroup (R,+) of real 
numbers under addition. The codomain of the exponential function is the 
semigroup (R,-) of real numbers under multiplication. The equation (5.1) 
says that we may either add two real numbers x and y in the domain, and 
then map across to exp(a + y) in the codomain, or else map x and y across 
individually to exp(a), exp(y) in the codomain, and then multiply these two 
numbers in the codomain. Either way, we get the same answer. 


exp(a + y) =e 


DEFINITION 5.1 (Homomorphisms and isomorphisms of semi- 
groups, monoids, and groups.) 


(a) Let 6 : (X,0) > (Y,x*) be a function from a semigroup (X,0°) to a 
semigroup (Y,*). Then @ is said to be a semigroup homomorphism if 
O(x1 0&2) = O(x1) * O(a2) 
for all x1, x2 inX. 


(b) Let 0: (X,0,e) > (Y,x, f) be a function from a monoid (X,0,e) to a 
monoid (Y,*, f). Then @ is said to be a monoid homomorphism if it is 
a semigroup homomorphism 6 : (X,0,) — (Y,*) with 6(e) = f. 


(c) Let 6: (X,0,e) > (Y,*,f) be a function from a group (X,°,e) to a 
group (Y,*,f). Then @ is said to be a group homomorphism if tt is a 
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monoid homomorphism 0 : (X,0,e) > (Y,*, f) with 0(x—') = (0(x))~+ 
for all x in X. 


(d) Bijective semigroup, monoid, and group homomorphisms are described 
respectively as semigroup, monoid, and group isomorphisms. 


The relationship of isomorphism between semigroups, monoids, or groups 
X and Y is often denoted by 
xXx2Y 


— compare (2.25). The context should make clear what kind of isomorphism 
is presented: of sets, semigroups, monoids, or groups. 


Example 5.2 (The exponential function.) 

The law of exponents (5.1) shows that exp : (R,+) — (R,-) is a semigroup 
homomorphism from the semigroup of real numbers under addition to the 
semigroup of real numbers under multiplication. Furthermore, the equation 


exp(0) = 1 


shows that exp : (R,+,0) — (R,-,1) is a monoid homomorphism from the 
monoid of real numbers under addition to the monoid of real numbers under 
multiplication. l] 


Example 5.3 (Inclusion of a subgroup.) 
Let H be a subgroup of a group G. Then the inclusion function 


J: HoG heh 


is a group homomorphism. l] 


Example 5.4 (Projections.) 
Given sets X and Y, define the respective projections 


miXxXYoX;(t,y)r 2 
and 
TM: X XY oY;(a,y)ry 


to the first and second factors. If X and Y are semigroups, monoids, or groups, 
then the projections are homomorphisms of semigroups, monoids, and groups 
respectively. 


PROPOSITION 5.5 (Semigroup homomorphisms between groups.) 
Let 0: (X,0) > (Y,*) be a semigroup homomorphism between two groups 
(X,0,e) and (Y,x*, f). Then @ is a group homomorphism. 
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PROOF § Since @ is a semigroup homomorphism, the equation 
O(e) * 0(e) = O(eo e) = O(e) 


holds in Y. However, since f is the identity element of Y, and 6@(e) is an 
element of Y, the identity property (4.5) in Y gives 


O(e) * f = Oe). 
It follows that 0(e) x 6(e) = O(e) * f, so A(e) = f by Corollary 4.54, and @ isa 


monoid homomorphism. 
Now for each element x of X, we have 


O(x) * O(a") = O(ao a") = O(e) = f. 


But 6(x) * (0(x))~' = f, so Corollary 4.54 again gives @(a~1) = (@(x))7}, 
making # a group homomorphism. l] 


In contrast with Proposition 5.5, a semigroup homomorphism between 
monoids need not be a monoid homomorphism (compare Exercise 1). 


THEOREM 5.6 (Monoid homomorphisms and groups of units.) 
Let 6: (M,0,e) — (N,x, f) be a monoid homomorphism. Then @ restricts to 
a group homomorphism 6* : M* — N* between the corresponding groups of 
units. 


PROOF Suppose that u lies in M*, with wov =e = vou for some v in 
M. Then 
O(u) * A(v) = O(uo v) = O(e) = f = A(v) * A(u), 


so that 6(u) lies in N*. The restriction 
0* : M* > N*;ur% O(u) 


is a semigroup homomorphism between the respective groups of units. By 
Proposition 5.5, it is then a group homomorphism. 


Example 5.7 (Determinants.) 
The determinant function 


det : R3 — R; i | + ad — be (5.2) 


is a monoid homomorphism from the monoid of 2 x 2 real matrices under 
multiplication to the monoid of real numbers under multiplication (compare 
Exercise 7). It restricts to a group homomorphism from GL(2, R) to the group 
of nonzero real numbers under multiplication. l] 
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A function f : X — Y between sets is fully described by its graph, the 
subset 


{(a, f(#)) |x in X} (5.3) 
of X x Y. Homomorphisms may then be recognized by their graphs. 


PROPOSITION 5.8 (The graph of a homomorphism.) 

Let (X,0) and (Y,*) be semigroups. Then a function f : X — Y is a 
semigroup homomorphism if and only if the graph (5.3) is a subsemigroup of 
the direct product semigroup X x Y. 


PROOF If f is a semigroup homomorphism, and 21, x2 are elements of 
X, then 


(x1, f(@1)) (x2, f(@2)) = (1 © @2, f(x1) * f(w2)) = (@1 0 £2, f(x1 0 @2)), 


so the graph is closed under multiplication. Conversely, suppose the graph 
is closed under multiplication. Then for elements 2; and x2 of X, the graph 
contains both (21 0 vo, f(@1 0 r2)) and 


(x1, f(@1)) (2, f(@2)) = (@1 © 2, f(x1) * f(x2)). 


However, since f is a function, there is a unique element (2, f(x)) of the graph 
for each element x of X. By this uniqueness for the element x, 0 x2 of X, we 
have f(x, 02) = f(21) * f(a), so that f is a semigroup homomorphism. [] 


COROLLARY 5.9 

Let (X,0,e) and (Y,*, f) be monoids. Then a function f : X > Y is a 
monoid homomorphism if and only if the graph (5.3) is a submonoid of the 
direct product monoid X x Y. 


5.2 Normal subgroups 


Let f : X — Y bea function. The image f(X) = {f(z) | a in X}isa 
subset of the codomain Y. If f is a homomorphism of semigroups, monoids, 
or groups, the image will carry the corresponding algebra structure. 


PROPOSITION 5.10 (Images of homomorphisms. ) 
Let f : (X,-) > (Y,-) be a semigroup homomorphism. 


(a) The image f(X) is a subsemigroup of Y. 
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(b) If f : (X,-,ex) > (Y,-,eyv) ts a monoid homomorphism, then f(X) is 
a submonoid of Y. 
(c) If f : (X,-,ex) — (Y,-,ey) ts @ group homomorphism, then f(X) is a 
subgroup of Y. 
PROOF (a): For elements x and w’ of X, we have f(x)f(2’) = f(az’), 
showing that f(X) is closed under multiplication. 
(b): In this case ey = f(ex) lies in f(X). 


(c): For an element x of X, we have f()~! = f(a~'), so that f(X) is closed 
under inversion. (] 


Now consider a group homomorphism f : X — Y from a group (X,-,ex) 
to a group (Y,:,ey). Asa function f : X — Y from the domain set X to the 
codomain set Y, the homomorphism f : X — Y specifies a kernel relation 
ker f on X, with 


x ker f 2’ if and only if f@) =f) 


(3.3). The equivalence class [ex]ker ¢ of the identity element ex of X is the 
inverse image 


f-"{Flex)} 


— compare (3.5) and Exercise 3 of Chapter 3. Since f : X — Y is a group 
homomorphism, this equivalence class may be expressed in the form 


[ex]ker f= f~'{ey} (5.4) 
as the inverse image of the identity element ey of the codomain group Y. 
PROPOSITION 5.11 (Kernel class of the identity.) 
Let f : (X,-,ex) > (Y,-,ey) be a group homomorphism. 
(a) The equivalence class (5.4) forms a subgroup N of X. 
(b) For alla in X andn in N, 
ana’ liesin N. (5.5) 


PROOF (a): First note that N is nonempty, since it contains the element 
ex. Then for elements n and n’ of N, the homomorphic properties of f give 


f(n'n™) = f(n') f(n") = f(n')f(n)™ = evey' = ey, 


so that N is a subgroup of X by Proposition 4.43. 
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(b): The homomorphic properties of f give 
flana-) = f(a) f(n) fe) = flaey fla)! = fla) f(a)" = ey, 


so that xnx—! lies in N as required. ] 


DEFINITION 5.12 (Normal subgroups, group kernels.) Let X be 
a group. 


(a) A subgroup N of X satisfying the additional closure property (5.5) is 
called a normal subgroup of X. 


(b) For a group homomorphism f : X > Y with domain X, the normal 
subgroup f— {ey} of X is called the (group) kernel Ker f of f. 


Note the distinction between the kernel relation ker f (lower case “k”) and 
the kernel subgroup Ker f (upper case “K”). 


Example 5.13 (The Klein 4-group.) 

The Klein 4-group V4 is a normal subgroup of the symmetric group S4. 
Indeed, the nonidentity elements of V4 consist of all 3 possible products ao a’ 
with disjoint 2-cycles a and a’. Then for any permutation @ in S4, we have 


Bo(aca’)oB' =(BoacfB')o(Boa' of). (5.6) 


By Exercise 38 in Chapter 2, the two factors on the right-hand side of (5.6) 
are again disjoint 2-cycles. Thus V4 satisfies the additional closure property 
(5.5). On the other hand, {(0), (0 1)} is a subgroup of $4 which is not normal 
(compare Exercise 14). J 


The easy proof of the following result is left as Exercise 15. 


PROPOSITION 5.14 (Normal subgroups of abelian groups.) 


In an abelian group G, every subgroup is normal. 


Consider a group homomorphism f : (G,-,ex) — (Y,-,ey). According to 
Definition 5.12(b), the equivalence class [ex ]ker ¢ of the identity element ex 
of X under the kernel relation ker f is the group kernel Ker f. More generally, 
each equivalence class under the kernel relation ker f is a coset of the group 
kernel Ker f. 


PROPOSITION 5.15 (Kernel classes are cosets.) 

Let f : X — Y be a group homomorphism, with kernel relation ker f and 
group kernel N = Ker f. Let x be an element of X. Then the equivalence 
class [t]ker under the kernel relation ker f is the coset Nx. 
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PROOF As usual, in proving the equality of the two sets []ker and Na, 
we will show that each is contained in the other. 

First, consider an element y of the equivalence class []ker ¢, so f(x) = f(y). 
Then by the homomorphic property of f, 


1 1 


so yx + is some member n of N = f~l{ey}. As yr~ 
the member na of the coset Na. 
Conversely, consider a member nz of the coset Na, with n in N. Then 


f(na) = f(n)f(@) = ev f(x) = fla), 


whence nx ker f x, and nz lies in [2]ker ¢ by the symmetry of ker f. (] 


=n, we obtain y as 


Proposition 5.15 allows us to recognize the set. Xker ¢ of equivalence classes 
as the set 
X/N={Na|a in X} 


of right cosets of the normal subgroup N = Ker f, and to recognize the 
surjection 
si: X— Xker f} a [tlker f 


(3.12) from the First Isomorphism Theorem as the map 
s:X 3 X/N;rwH No. 


In the next section, it will be shown that each normal subgroup N of a group 
X yields a group structure on its set X/N of cosets. 


5.3 Quotients 
For subsets A and B of a group (X,-,ex), consider the multiplication 
A-B={ab|a in A, b in B} (5.7) 
— compare (4.23). 
PROPOSITION 5.16 (Recognizing subgroups.) 
Let X be a group. 
(a) The multiplication (5.7) is associative. 


(b) A nonempty subset H of X is a subgroup if and only if H. H = H and 
AN= HH; 
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PROOF (a): Let A, B, and C be subsets of X. Then 


A-(B-C)= {a(bc)|a in A, b in B, c in Ch 
={(ab)c)|a in A, b in B,c in C}=(A-B)-C. 


(b): If nonempty H satisfies the two equalities, then x and y in H imply ry~+ 


lies in H-H-!= H-H =H, so H is asubgroup by Proposition 4.43. 

Now suppose that H is a subgroup. Then H.-H C H by the closure under 
multiplication. Conversely, each element h of H can be written as eh in H-H. 
Also H~-! C H since H is closed under inversion. On the other hand, each 
element h of H can be written as the element (h~1)~! of H7!. 


PROPOSITION 5.17 (Cosets of normal subgroups.) 
Let N be a normal subgroup of a group X. Then the set 
X/N={Na|ax in X} 
of right cosets is a group (X/N,-,N) under the multiplication (5.7), with 
(Nx)! = Na! (5.8) 


for x in X. 


PROOF First note that N = xNa~! for any element x of X. Certainly 
tNat|CN 


by the closure property (5.5) of N. Conversely, if n lies in N, then so does 


= one 1 Then. = a tans): St ae = on (ae) liessin 


x !Nz by the closure property (5.5). 
Now we have 
Na-Ny=N-2Na!-.2y=N-Nay= Nay, (5.9) 


so X/N is closed under the associative multiplication of cosets. For any x in 
X, we have N- Na =(N-N)a = Na and 


Na-N=N-aNa1.c=N-Nc=Nz, 
so that N is an identity element. Finally 
N«-Na!=N-aN«'|=N-N=N 


and 
Na\.Nax=WN-(a2")N(a')'=N-N=N, 


so that Na~! is the inverse of the coset Na. (] 
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By Proposition 5.11, group kernels are normal subgroups. The converse is 
now seen to be true: normal subgroups are group kernels. 


COROLLARY 5.18 
Let N be a normal subgroup of a group X. Then there is a homomorphism 


X > X/N;x- Na 


with group kernel N. 


PROOF By (5.9), the map «+ Nz is a group homomorphism. Its group 
kernel is {2 | Nx = NS =N. l] 


DEFINITION 5.19 (Quotient groups.) Let N be a normal subgroup 
of a group X. Then the group 


(X/N,-, N) 


of Proposition 5.17 is called the quotient of X by the normal subgroup N. 


Example 5.20 (Modular arithmetic.) 


Let d be a positive integer. In the group (Z,+,0) of integers under addition, 
the subgroup dZ of multiples of d is normal. The quotient group Z/dZ is the 
set Zmoad, With the addition 


(dZ + a) + (dZ+ 6) =dZ+ (a+b) 
given in (3.21). Inverses are given by the negation 
—(dZ+a)=dZ-a, 


while the identity element is the subgroup dZ. 
In fact, the set Z/dZ carries more structure, the multiplication 


(dZ + a) (dZ +) = dZ + (a:b) 


given in (3.22). Under this multiplication, the set Z/dZ becomes a monoid, 
with identity element dZ+ 1. Furthermore, the set Z/q inherits the group 
and monoid structure from Z/dZ = Zmoaa Via the isomorphism (3.18). (See 
Exercises 5 and 23.) 
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5.4 The First Isomorphism Theorem for Groups 


The results of the preceding sections may be summarized to show how a 
group homomorphism factorizes under a strengthened version of the First 
Isomorphism Theorem for Sets (Theorem 3.11, page 57). 


THEOREM 5.21 (First Isomorphism Theorem for Groups.) 
Let f : (X,-,ex) > (Y,-,ey) be a group homomorphism. 


(a) The group kernel N = f~'{ey} is a normal subgroup of the domain 
group X. 


(b) The image f(X) ts a subgroup of the codomain group Y. 


(c) In the factorization 
f=jobos 


given by the First Isomorphism Theorem for Sets, the surjection s may 
be taken as the surjective homomorphism 


s:X 3 X/N;xw Na 
of Corollary 5.18, the bijection b is the well-defined group isomorphism 
b: X/N = f(X); Ne f(2) 


from the quotient X/N to the image f(X), and the injection j is the 
injective group homomorphism 


J: HX) SY; fe) > f(a) 
of Example 5.8. 


If the domain of the group homomorphism in the First Isomorphism Theo- 
rem is finite, then the bijection b may be used to count the size of the image. 


COROLLARY 5.22 
Let f : X — Y be a group homomorphism with group kernel N and finite 
domain X. Then the size |f(X)| of the image of f is the index 

|X/N| = |X]/|N| 


of the subgroup N of X. 
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Example 5.23 (The special linear group.) 
Consider the group homomorphism 


b 


det : GL(2,R) — R*; E d 


| ea te 


(compare Example 5.7 and Exercise 7). The group kernel is the set SL(2, R) 
of 2 x 2 real matrices of determinant 1. This group SL(2,R) is called the 
(real) special linear group of dimension 2. The First Isomorphism Theorem 
for Groups exhibits the isomorphism 


GL(2, R)/SL(2,R) & R* 


from the quotient group to the group of nonzero real numbers under multi- 
plication. 


An important application of the First Isomorphism Theorem for Groups is 
the classical Chinese Remainder Theorem. 


THEOREM 5.24 (Chinese Remainder Theorem.) 


Let a and b be coprime positive integers. Then there are isomorphisms 
Z/abZ = Z/aZ x Z/bZ 


of sets, groups under addition, and monoids under multiplication. 


PROOF Consider the map 


p:Z/abZ > Z/aZ x Z/bZ; 
abZ + ur (aZ+2,bZ4+ 2). 


It is clearly well-defined, since 
ab|(x—2a’) implies a|(a—2’) and b| (x—2’). 


It is certainly a (semi)group and monoid homomorphism. For an element 
abZ + x of the group kernel Ker p, the representative integer x is a multiple 
of both a and b. Since a and 0 are coprime, their lowest common multiple is 
ab (compare Exercise 43 in Chapter 1). Thus ab | x, and the group kernel 
Ker p is trivial. It follows that the group homomorphism is injective, since 
the classes of the kernel relation ker p are the cosets of the subgroup Ker p. 
Since the domain and codomain have the same finite number of elements, 
Corollary 5.22 shows that the map p is surjective. 
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5.5 The Law of Exponents 


Let x be an element of a monoid (M,-,e). Then for natural numbers n, the 
powers x” are defined recursively by 


=e and ghtt = gg (5.10) 
(compare Section 2.4 for powers in a semigroup or monoid of functions). 


e In the monoid (R,-, 1) of real numbers under multiplication, the notation 
(5.10) agrees with the usual power notation for real numbers <. 


e In the monoid (N,+,0) of natural numbers under addition, the power 
notation x” for a natural number z translates to the multiple notation 
nx. 


Generally, for a (commutative) monoid (M,+,0) written using additive 
notation, the recursive definition (5.10) of powers translates to a recursive 
definition 

Or =0 and (n+ l)a=na+au 


of multiples. 
For an element x of a monoid (M,.-,e), and natural numbers m, n, the Law 

of Exponents 
gee cag gh (5.11) 


may be proved by induction on n (Exercise 28). The Law of Exponents 
underlies the following theorem, which shows the special role played by the 
monoid (N,+,0) of natural numbers under addition, and the number 1 as an 
element of that monoid. 


THEOREM 5.25 (Universality of natural numbers.) 
Let x be an element of a monoid (M,-,e). Then there is a unique monoid 
homomorphism 


Foi) Dae ae (5.12) 
with f(1) =a. 


PROOF The map f of (5.12) is a monoid homomorphism, since 


by definition, and 
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for natural numbers m and n by the Law of Exponents (5.11). Now suppose 
that » : (N,+,0) — (M,-,e) is a monoid homomorphism with y(1) = z. 
For natural numbers n, the equation y(n) = f(n) follows by induction on 
n. Certainly y(0) = e = f(0), since y is a monoid homomorphism. Then if 
p(n) = f(n), we have 


o(n +1) = 9(n)- v(1) = 9(n)-2= f(n)-c=a"-c=a™' = f(n+), 


the first equation holding since y is a semigroup homomorphism. ] 


Now let x be an element of a group (G,-,e). For each positive integer n, 
define 


esl ty. 


The Law of Exponents (5.11) for monoids may then be extended to a Law of 
Exponents 
we aera (5.13) 


for groups, holding for all integers m and n (Exercise 30). The analogue of 
Theorem 5.25 holds (Exercise 31). It points out the special role played by the 
group (Z,+,0) of integers under addition, and the element 1 of that group. 


THEOREM 5.26 (Universality of integers.) 
Suppose that x is an element of a group (G,:,e). Then there is a unique 
group homomorphism 


exp, : (Z, +,0) = (G, "5 ey; nex” (5.14) 


with exp,(1) = «. 


Example 5.27 (Exponentiation.) 
Consider the element e of the group (R*,-,1) of nonzero real numbers under 
multiplication. Then 

exp,(n) = e” 


for each integer n. For the element 2 of R*, we have exp,(n) = 2” for each 
integer n. Thus the uniquely specified group homomorphism (5.14) may be 
considered as an “exponentiation to base x” in the group G. 


In Theorem 5.26, the group kernel Ker(exp,,.) of the homomorphism exp,, 
is a subgroup of the group (Z,+,0) of integers. By Theorem 4.46 (page 81), 
the group Ker(exp,,) is the set of multiples of a natural number d,. 


DEFINITION 5.28 (Cyclic group generated, order of element.) 
Let x be an element of a group (G,-,e). 
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(a) The image (x) of the group homomorphism exp, in (5.14) is called the 
(cyclic) subgroup of G' generated by z. 


(b) If d; =0, the element x is said to be of infinite order. 


(c) If dy, is a positive integer, the element x is said to be of finite order d,. 


Note that part (b) of the First Isomorphism Theorem for Groups, applied 

to the group homomorphism exp, : Z — G, confirms that the image 
ie ii Gta ee Se a) 

of exp,,, the set of all powers of the element 2, really is a subgroup of G. 

Two cases arise: 

e If x has infinite order, the group kernel of exp, is trivial. Thus exp, 
is injective, and the powers 
ne we, = e, gi = x, x, x, are 
of x are all distinct. Part (c) of the First Isomorphism Theorem for Groups, 
applied to the group homomorphism exp, : Z — G, then yields the group 
isomorphism 

b:Z— (x);n a” 

between the infinite cyclic group (x) and the group of integers (Z,+,0) under 
addition. In general, any group C isomorphic to the group of integers is 
described as an infinite cyclic group. 

e If « has finite order d,, the bijection 6 in the First Isomorphism 
Theorem for Groups shows that x has precisely d, distinct powers 

x? =e, x =f, x, z°, sie gear t 

Since the classes of the kernel relation ker(exp,,) are the cosets of the subgroup 
d,Z, two powers x” and x” of x are equal if and only if the difference m—n 
is a multiple of the order d,. In this case, we may also consider the indices n 
in powers x” of x as integers modulo dy. In other words, when x has finite 
order d, the bijection b in the First Isomorphism Theorem for Groups yields 
the group isomorphism b : Z/q — (x);n + x” between the cyclic group (x) 
of size d and the group of integers (Z/a,+,0) modulo d under addition. 

In general, any group Cg isomorphic to the group of integers modulo d under 
addition is described as a cyclic group of finite order d. This is consistent with 
the nomenclature of Example 2.30 (page 38) — compare Exercise 34. 


REMARK 5.29 Let x be an element of a group G. Whether x has finite 
or infinite order, this order is just the size (or cardinality) 


|(x)| (5.15) 


of the cyclic group (x) generated by x. It is convenient to use (5.15) as a 
standard notation for the order of a group element 2. l] 
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5.6 Cayley’s Theorem 


Abstract groups were introduced as a generalization of the concept of a 
group of permutations. It will now be shown that the generalization has not 
strayed too far: 


Every group is isomorphic to a group of permutations. 


To put groups in context, we begin by examining semigroups. 
Let (S,-) be a semigroup. Then for each element s of S, we define the left 
multiplication by s to be the map 


Agi: S > S;nrs-a. (5.16) 


Example 5.30 (Real shifts.) 

Let (R.+) be the semigroup of real numbers under addition. Then for each 
real number r, the left multiplication 4, by r is the shift o, of Example 2.5 
(page 30). 


Example 5.31 (Cycles.) 
Let n be a positive integer. Let (Z/n, +) be the semigroup of integers modulo 
n under addition. Then A, is the cycle 


(012... (n—1)) 


from the cyclic group C,, (Example 2.30, page 38). l] 


PROPOSITION 5.32 

Let (S,-) be a semigroup. Consider the semigroup (S°,0) of all functions 
from the set S to itself, with the operation of function composition. Then the 
map 


As (S,-) = (S%,0); 8 ard Xs 


is a semigroup homomorphism. 


PROOF For elements s, t, and x of S, the associative law yields 
(As 0 Az) (@) = As(Az(@)) = As(E-v) = 8+ (t- a) =(s-t)- v= Age(z). 


Thus the composite map A, 0 Ay is equal to the single map A,.4. Rewriting in 
terms of A, we obtain A(s) o A(t) = A(s-t), showing that A is a semigroup 
homomorphism. 
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Example 5.33 
Let X be a set. Define an operation - on X by 


vyY=y 
for all x, yin X. This operation is associative, since 
t(y-z)=G=2-y=(r-y)-z 


for x, y, z in X. In the semigroup (X,-), we have \, = idx for all x in X. 
The map A of Proposition 5.32 becomes the constant map 


A:aridx 
in this case. (] 


For groups, the collapse observed in Example 5.33 cannot happen. In fact, 
we obtain the desired isomorphism of each abstract group with a group of 
permutations. 


THEOREM 5.34 (Cayley’s Theorem.) 
Let (G,-,e) be a group. 
(a) The semigroup homomorphism 
A: (G,-) > (GF,0);2 4 Ag 
ws injective. 
(b) The image of A is a group of permutations on the set G. 
(c) The abstract group G is isomorphic to the group A(G) of permutations 


of the set G. 


PROOF (a): Suppose A, = A, for elements x and y of G. Then 


t=x-e=dAz(e)=A,(e) =y-e=y. 


(b): For each element x of G, the map A, is invertible, with two-sided inverse 
A,z-1- Indeed, for each element g of G, we have 


Az OAp-1(9) = 2-27" -g=g=idx(g), 
so that A,0A,-1 = idx. Considering x~! in place of x yields A,-10Az = idx. 


(c): The map (G,-,e) > (A(G),°,idx);2 + Az is a bijective semigroup 
homomorphism between groups. As such, it is a group isomorphism. l] 
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Example 5.35 (Position vectors and translation vectors.) 

Consider the group R? of 2-dimensional real vectors (Example 4.32, page 75). 
An element (a1, 72) of R? represents a position vector. Its image X(.,,29) in 
A(R?) under the isomorphism of Theorem 5.34(c) becomes the corresponding 
translation vector 


Nees eo : IR? > R?; (a1, a2) (21 + a4, £2 + a2) 


(Figure 5.1). Thus according to Cayley’s Theorem, the group of position 
vectors under addition is isomorphic to the group of translation vectors under 
addition. 


translation 
vector 


X(@1,02) 


position 
vector 
(a4 ry az) 


position 
vector 
(x1 + a1, £2 + a2) 


position 
vector 
(x 1> £2) 


FIGURE 5.1: Position vectors and translation vectors. 
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Exercises 


1. Define 6: P(1) — P(2); X + X (compare Example 4.11 and Exercise 6 


7. 


8. 


in Chapter 4). 


(a) Show that 0: (P(1),9) — (P(2),/N) is asemigroup homomorphism. 


(b) Show that @ is not a monoid homomorphism. 


. Show that the group (R*,+, (0,0,0,0)) of 4-dimensional real vectors is 


isomorphic to the group 


(+ [09)) 


of real 2 x 2 matrices under addition. 


. Show that the group Up of shifts on R (compare Example 2.28, page 


37) is isomorphic to the group (R,+,0) of real numbers under addition. 


. Let f : G— H and g: H — K be semigroup homomorphisms. Show 


that the composite go f : G— K is also a semigroup homomorphism. 


. Let 0: X — Y be a bijection from the set X to the set Y. 


(a) If X carries a semigroup structure (X,0o), show that there is a 
unique way of defining an associative multiplication « on Y to yield 
a semigroup isomorphism @ : (X,0) — (Y,*). 

(b) Suppose further that (X,o,e) is a monoid. Show that (Y, *, 0(e)) 
is a monoid. 

(c) Suppose further that (X,0,e) is a group. Show that (Y,*,6(e)) is 
a group. 


In (a), (b), and (c), we say that the set Y inherits the semigroup, 
monoid, or group structure from X via 6. 


. Show that the group ((Z/2)?,+,00) of length 2 bit strings (compare 


Example 4.35, page 78) is isomorphic to the Klein four-group V4. 
For real numbers a, b,c, d, a’, b’,c’, d’, verify the identity 

(ad — be)(a'd' — b’c’) = (aa’ + bc’) (cb! + dd’) — (ab! + bd’) (ca’ + dc’) . 
Define the trace function 


tr: R?2 SR; : “ rat+d (5.17) 


11. 


12. 


13. 


14. 


15. 
16. 


17. 


18. 
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from the group R3 of 2 x 2 real matrices under componentwise addition 
(4.17) to the group of real numbers under addition. Show that the trace 
function is a group homomorphism. 


. Prove Corollary 5.9. 
10. 


Let X and Y be groups. Show that a function f : X — Y is a group 
homomorphism if and only if the graph of f is a subgroup of the direct 
product group X x Y. 


Let n be a positive integer. Show that there is a group homomorphism 
p: Cy, > GL(2,R) 
with 


p((0 12... (n—2) (n—- )’) = ee — sin(2r7/n) 


sin(2rm/n) cos(2ra/n) 


for 1 <r <n. (A homomorphism p from a group G to a group of 
matrices is known as a matrix representation of the group G.) 


Show that there is an injective monoid homomorphism from the monoid 
A of affine functions to the monoid R3 of real 2 x 2 matrices under 
multiplication, given by mapping the function 


f:R-Rarrem-t+e 


fos) 


Show that the injective monoid homomorphism of Exercise 12 restricts 
to a matrix representation p of the group Aff(R) of affine functions 
(compare Exercise 45 in Chapter 2). 


to the matrix 


(a) Show that {(0), (0 1)} is a subgroup of the symmetric group S3. 
(b) Show that {(0), (0 1)} is not a normal subgroup of $3. 


Prove Proposition 5.14. 


Let N be a subgroup of a group G. Let x and y be elements of G. Show 
that the cosets Nx and Ny are equal if and only if y~! is an element 
of N. 


Show that a subgroup N of a group G is normal if and only if each right 
coset Na of N with an element x of G is equal to the left coset «N. 


Show that a subgroup N of a group G is normal if and only if each right 
coset Na of N with an element x of G is equal to the left coset yN with 
some element y of G. 
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19. 


20. 


21. 


22. 


23. 
24. 


25. 
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Give an example of a group G, a subgroup H, and an element x of G 
such that the right coset Hx and left coset xH are distinct subsets of 
G. 


Let M and N be normal subgroups of a group G. Show that MN is 
also a normal subgroup of G. 


For a normal subgroup N of a group G, the equation (5.8) defines the 
inverse (Nax)~+ of the coset Na in the quotient group G/N. Show that 
the equation is also consistent with a different interpretation of the left- 
hand side, namely as the set 


(Nz)~+ ={(nz)~*|n in N} 
of inverses (in G) of elements of the coset Na. 


Let f : (X,-,ex) — (Y,-,ey) be a group homomorphism. Show that 
f is injective if and only if Ker f = {ex}. (Compare Exercise 4 in 
Chapter 3.) 


Verify the claims of Example 5.20. 
(a) Show that the intersection 
SO2(R) = SL(2, R) 7 O2(R) 


is a normal subgroup of the group O2(R) of orthogonal matrices. 
(Compare Study Project 2 in Chapter 4.) 


(b) Show that there is an isomorphism 


Oo(R)/SOo(R) & {+1} 


from the quotient group to the group of real numbers {+1} under 
multiplication. 


(Hint: Use the First Isomorphism Theorem for Groups to obtain both 
(a) and (b) directly.] 


Let M and N be normal subgroups of a group G. 
(a) Show that the map 
06:G—>G/M x G/N;2% (Mz, Na) 


is a group homomorphism. 
(b) Show that the group kernel of 6 is the intersection MM N of the 
normal subgroups M and N. 


(c) Conclude that MN is a normal subgroup of G. 


26. 


27. 


28. 
29. 


30. 


31. 
32. 
33. 


34. 


35. 
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Let G be a group. Consider the subset 


G={(9,9)|9 in G} 
of G x G, the so-called diagonal. 


(a) Show that G is a subgroup of G x G. (Hint: If you have done 
Exercise 10, you may apply it by noting that the diagonal is the 
graph of the group homomorphism idg.) 


(b) Show that G is commutative if and only if G is a normal subgroup 
of Gx G. 


Find a solution x to the simultaneous congruences 
x=2 mod7, 
x=7 mod 10. 
Prove the Law of Exponents (5.11) for an element x of a monoid (M,-,e). 


Show that in additive notation, the Law of Exponents takes the form 
(m+ n)x = max + nx 
of a right distributive law. 


Prove the Law of Exponents (5.13) for an element x of a group (G,-,e). 
(Hint: Beyond the monoid version, the additional cases to verify are 
when one or both of m, n is negative.) 


Prove Theorem 5.26. 
Let d be a positive integer. Show that the order of dZ+ 1 in Z/dZ is d. 


Suppose that m and n are coprime positive integers. Show that there is 
a group isomorphism 
Cnn = Cm x Ch % 


Consider a cycle 
Q@ = (1 £2 3... La) 


of length a in the symmetric group S,,. Show that a@ has order a. 


Let r be a positive integer. Show that the order of the matrix 


cos(27/r) — sin(27/r) 
sin(2m/r) cos(27/r) 


in GL(2,R) is r. 
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36 


37. 


38. 


39. 


40. 


Al. 


42. 


43. 


44. 
45. 
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Let x be an element of a finite group G of size n. Applying Lagrange’s 
Theorem to the subgroup (x), show that the order of x is a divisor of n. 


Let x be an element of finite order in a group G. Show that the order 
of each power of x is a divisor of the order of z. 


Let x be an element of a group G. Show that the order of x~! is the 
order of x. 


Let x be an element of odd finite order in a group G. Show that x? has 
the same order as x. 


In the affine group Aff(R), determine precisely which elements have 
finite order. (Hint: You may wish to use the matrix representation p 
from Exercise 13.) 


Let f : G— H be a group homomorphism with finite domain. For each 
element x of G, show that the order of f(x) is finite, and a divisor of 
the order of x. 


In a group G, consider elements x and y of coprime finite orders a and 
b. Suppose that xy = yx. Show that there is a group isomorphism 


Z/a x Z/y > (xy); (7,8) > a"y?. 
In the symmetric group S;,, consider disjoint cycles a and ( of respective 
lengths a and b, with a and b coprime. Show that ao @ has order ab. 
Show that every monoid is isomorphic to a monoid of functions. 


Let X and Y be isomorphic sets. 


(a) Show that the monoids X* and Y* are isomorphic. (Compare 
Example 2.2, page 29.) 


(b) Show that the symmetric groups X! and Y! are isomorphic. 


Study projects 


1. Error-correcting codes. Storing information for future retrieval, and 


transmitting information from one location to another, are two basic 
tasks of information technology. These tasks are complicated by the 
occurrence of errors. Information storage media are subject to damage 
and deterioration, while communication channels are subject to noise 
and interference. Error-correcting codes are designed to compensate 
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for the effects of errors that are not too serious, enabling the original 
information to be recovered. (They are not directly related to secret 
codes as discussed in Section 2.9.) 


The simplest piece of information is a single yes/no dichotomy, a binary 
digit or bit taking the value 1 or 0 (page 59). It could represent, say, the 
verdict of a jury in a criminal trial: “Guilty” or “Not guilty.” If an error 
occurs, it will change the bit from 1 to 0 or vice versa. The information 
in the bit will be lost completely. To protect the information from errors 
changing a single bit, redundancy is added. The single information bit 
1 is encoded as the bit string 111, while 0 is encoded as 000. The full 
set of encoded messages is the code 


C = {000,111}. (5.18) 


Now if an error changes a bit, say changes 111 to 011, the original 
information bit 1 may be recovered as the commonest bit in the string 
011 (“majority vote”). The price paid for this robustness is a tripling 
in the space needed to record the information (3 bits instead of 1), or 
the time taken to transmit it. 

Groups play a key role in the design of coding schemes. Consider the 
group ((Z/2)?,+,000) of bit strings of length 3 (Example 4.35, page 
78). First, confirm that the code C is a subgroup of this group. Now 
changing the first bit from the right in a bit string of length 3 means 
adding the error 001, thus 


111+ 001 = 110. 
There are three possible single-bit errors: 
e; = 001, eg = 010, e3 = 100, 


labeled by the error location, counting from the right. For completeness, 
the zero “error” is 
€o = 000. 


Adding this “error” means no change to the bit string. The full set of 
errors is 


E = {e9 = 000, e; = 001, e2 =010, e3 = 100}. 


Although E is a subset of the set (Z/2)° of all bit strings of length 3, it 
is not a subgroup of ((Z/2)?,+,000). (Why not?) 
However, there is a bijection 
6 (Z/2)? > E: 
00 €oo0 = 000, 01 F> €01 = 001, 10% ej9 = 010, 11lR ell = 100 
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taking a bit string of length 2, the binary representation of a number 
0 <i < 2?, to the corresponding error e;. According to Exercise 5(c), 
this means that the set F inherits a group structure (£, *,000) making 
@ a group isomorphism. Show that the multiplication table of the error 
set E under * is as given in Figure 5.2. 


* 000 O01 010 100 


000 000 001 010 100 
001 001 000 100 = #010 
010 010 100 000 ~~ O01 
100 100 010 001 000 


FIGURE 5.2: Length 3 error set E under x. 


The code (5.18) is recovered from the set of errors as: 


000 = €o0 + €01 + €00 * €o1 = 000 + 001 + 000 « 001 = 000 
111 = €01 €10 €o1 * €10 = 001+ 010 +001 *010= 111. 


Further, show that: 
(a) A group homomorphism 
s: ((Z/2)°,+,000) — (E, *, 000) (5.19) 


is uniquely defined by e; +> e; for 0 <i <4. [The homomorphism 
(5.19) is known as the syndrome.| 


(b) The code C is the group kernel of the syndrome. For example, 
s(111) = s(e, + eg + e3) = e1 * €2 * e€3 = 000. 


(c) When a bit string x is received, the single error that is assumed to 
have occurred is s(#). For example, if x = 101 is received, the most 
probable single error is s(101) = s(e; + e3) = e, *e3 = eg = O10. 
The word xz = 101 is decoded as x + s(x) = 101+010=111. 


2. Hamming codes. For a positive integer r, define | = 2" —1. The 
integer r is known as the redundancy, and the integer | is known as the 
channel length. The set (Z/2)! of bit strings of length | is known as the 
channel. The errors are defined to be eo, the length / string of zeros, and 
e; for 1 <i < las the length / bit string with zeros everywhere except for 
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a 1 in the 7-th position counting from the right. (This notation extends 
the case r = 2 discussed in Study Project 1.) Usually, we consider the 
indices 0 < 7 < 2” as being expressed in binary notation, as bit strings 
of length r. 


(a) Show that the error set 
E={e,|0<i<2"} 


is not a subgroup of ((Z/2)',+,0), unless r = 1. 


ee 
<2 


Show that F inherits a group structure (F, *, eg) from the bijection 
0: (Z/2)" > Byinv eg. 


(c) Show that e; * e; = ex, where k is the Nim sum of 7 and 7 — 
compare Study Project 1 in Chapter 4. 


(d) Show that a group homomorphism 
8: ((Z/2)',+,e0) > (E,*, €0) (5.20) 


is uniquely defined by e; + e; for 0 < i < l. The homomorphism 
(5.20) is known as the syndrome. 


(e) Show that the group kernel C = Kers has 2'~" elements. The 
subgroup C of the channel ((Z /2)',+; €0) is known as the Hamming 
code of dimension | —r. 


(f) Write out the 16 elements of the Hamming code of dimension 4 in 
the channel of length 7. (Note that these may be used to encode 
hexadecimal digits.) 


(g) Show that the elements of the Hamming code may be obtained 
equally well, either as elements of the group kernel of the syndrome, 
or from expressions such as e; + ej + e; *e; for 1 <i Aj < 2°. 

(h) If the word 1101100 is received in the channel of length 7, which is 
the most likely element of the Hamming code to have been trans- 
mitted? 


(i) If the word 1101101 is received in the channel of length 7, which is 
the most likely element of the Hamming code to have been trans- 
mitted? 


3. Dyadic integers. When we do mathematics, we carry a picture of 
the set Z of integers much like (1.1). Although we normally work with 
relatively small integers, represented by their decimal expansions, we 
do have the capacity to contemplate arbitrarily large numbers, with a 
positive or negative sign. In a computer, the representation of integers 
is different. The representation corresponds to a binary expansion, and 
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FIGURE 5.3: Paths in the binary tree. 


there is a limit on the size of the integers that can be handled directly, 
say 2641. Computer representations of integers are modeled by the set 
Zo of dyadic or 2-adic integers. To emphasize the distinction, members 
of the set Z are described as rational integers. 


Replacing (1.1), the typical picture is displayed in Figure 5.3. The 
starting point is the asterisk at the bottom, the root of the binary tree. 
At the next level up are the two binary digits 0 and 1, forming the set 
Z/2 of integers modulo 2. Above them is the set Z/,4 of integers modulo 
4, with the elements written in their binary expansions: 


Z/o2 = {00,01, 10,11}. 
One level higher is 
Z/o3s = {000,001,..., 111}, 
and so on. Each node in the tree is a bit string b (considering * as the 


empty string or string of length zero). Two arrows emerge from the 
node b, one going to 0b and one to 1b. 
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Now consider a particular integer, say 6. In the set Z/2 of integers 
modulo 2, the integer 6 belongs to the class of even integers, repre- 
sented by the binary digit 0. In Z/22, the integer 6 is represented by 
6 mod 2? = 2, but written in binary notation as 10. In Z/g3, the in- 
teger 6 is represented by 6mod 2° = 110. In Z/s:, the integer 6 is 
represented by 6 mod 24 = 0110. From then on, 6 is represented by 
0...0110. Altogether, the integer 6 is represented by the path 


* > 0— 10 — 110 — 0110 — 00110 > ---— 0...0110—... 


in the binary tree. A general rational integer a is represented by the 
path 


* > amod2 — amod 2? > --- = amod 2” — amod 2"*! =... 


in the binary tree. The set Zp of dyadic integers consists of the full set 
of all paths from the root in the binary tree. In a particular computer 
implementation, the paths have to stop after a certain point, say after 
64 steps. 


(a) Find the path representing the integer 13. 

(b) Find the path representing the integer —1. 

(c) Find the path representing the integer —5. 
) 


(d) Show that the path representing a natural number n eventually has 
each step of the form 


k ZerOes k+1 Zeroes 
—_—_> —_—~_—S> 
00...006— 000...00 b 


for some finite bit string b that is a binary expansion of n. 


(e) Show that the path representing a negative number n eventually 
has each step of the form 


k ones k+1 ones 


oo aie. 
11...11b—-111...116 


for some finite bit string b. What is the relation of the bit string b 
to n? 


(f) Show that the path 
* — 1— 01 — 101 — 0101 — 10101 — 010101 — 1010101 —... 


represents a dyadic integer which is not a rational integer. 
(g) Show that the map 


(Z/or+1,+,0) > (Z/2r,+,0);amod 2"*1 +, amod 2” 


is a well-defined group homomorphism for each positive integer r. 
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(h) 


— 
me 
WN 
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Show that the map 
(Z/or+1,+,1) > (Z/or,-,1);a mod 2"*1 + amod 2” 


is a well-defined monoid homomorphism for each integer r > 0. 


Given two paths 


and 
ok by bo seat by. bpp 


(with ax, by in Z/ox for k > 0), show that there is a well-defined 
sum 


* > ay tby > agtbg 9 +++ 3 ap tbp F Gpy 1 +0p41 2... (5.21) 
and a well-defined product 
* > a+ by > ag: bp 2 +++ SB ay: bp Gp 41: bpp >... (5.22) 


of paths. 


Show that the set Zp of dyadic integers forms an additive group 
under the sum (5.21). 


Show that the set Z2 of dyadic integers forms a monoid under the 
multiplication (5.22). 


Show that the map Z — Zp» taking a rational integer a to the path 
* — amod 2 — amod 2? = --- > amod 2” — amod2"*! >... 


is a group homomorphism for sums and a monoid homomorphism 
for products. 


4. The Euler ¢-function. The Euler ¢-function is defined by setting 
¢(d), for a positive integer d, to be the number 


|(Z/a)*| 


of elements in the group of units of the monoid (Z/q,-,1) of integers 
modulo d under multiplication. 


(a) 
(b) 


Let d be a positive integer. For a nonzero integer a, show that 
amod d is a unit of (Z/a,-,1) if and only if gcd(a,d) = 1. 


For a prime number p, show that ¢(p) = p— 1. 
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(c) For a power p* of a prime number p, with e > 0, show that there 
are p®—! elements of the set 


Z/pe = {0,1,2,...,p°— 1} 


which are multiples of p. Conclude that 
e e e—1 e 1 
o(p°) =p’ — p*" =p fe ; 


(d) Use Theorem 5.6 and the Chinese Remainder Theorem to show 
that 
(Z/mn)” = (Z/m x Z/n)* 


for coprime positive integers m and n. 
(e) Use Theorem 4.33 (page 76) to conclude that 


o(mn) = $(m)9(n) (5.23) 


for coprime positive integers m and n. The property (5.23) is 
sometimes described as the multiplicativity of the Euler ¢-function. 


— 
eH 
Se 


Given a positive integer n, consider its factorization 


_— ll, €2 e 
n= Pp, Po ete Dee 


into a product of powers of distinct prime numbers, as given by the 
Fundamental Theorem of Arithmetic. Show that the value of the 
Euler ¢-function is 


#(n) =n(1-—) (1-=)...(1-2). (5.24) 


For d > 1, let 1 = dj, do, d3,...,ds_1,d, = n be a full list of the 
positive divisors of d. (Compare Figure 1.3 for d = 72.) Show that 


PT 
ithe} 
So 


o(di) + (de) +-+-+ (ds) =d. 


5. Consider the set 


*={ [0m 


of upper triangular matrices, and the division function 


n,m in Z, nzo} 


d:X >R; k | rn-'m (5.25) 


(compare Section 3.3). 
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(a) Show that (X,-,Z2) forms a monoid under the usual matrix multi- 
plication. 


(b 


Ww 


Show that (X,0) forms a monoid under Hadamard multiplication 
(4.18) of matrices. In particular, specify the identity element of 
this monoid. 


Show that the division function 


— 
lo) 
— 


d: (X,0) — (R,-) 


of (5.25) is a monoid homomorphism to the monoid of real numbers 
under multiplication. 


Show that the division function 


ge 
= 


d: (X, -, [2) ce (R, +, 0) 


of (5.25) is a monoid homomorphism to the monoid of real numbers 
under addition. Is it a group homomorphism? 


— 
oO 
Ne 


Apply the First Isomorphism Theorem for Sets (page 57) to the 
division function d of (5.25), obtaining the factorization 


; ra 


‘| Ie 


Xker d an Se Q 


in which the function b is invertible. Use the inverse isomorphism 
(of sets) 


bt : Q =a Xkerd; 
together with the technique of Exercise 5, to create an additive 
group structure (Xkera, +) and a multiplicative monoid structure 


(Xkerd;*) on the set Xyerg of equivalence classes. Show that the 
surjection s yields monoid homomorphisms 


8: (X,-+) > (Xkera; +) 


and 
Ss: (X,°) —s (Xker d; -) : 


In particular, note the indirect confirmation that the well-defined 
multiplication (3.9) and addition (3.10) are associative operations. 
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5.9 Notes 
Section 5.3 


Quotient groups are sometimes described as “factor groups,” particularly 
in the older literature. Unfortunately, this usage leads to confusion with the 
designation of X and Y as the factors in the direct product X x Y. 


Section 5.4 


The Chinese Remainder Theorem is usually attributed to the third century 
mathematician Sun Zi (not to be confused with the author of The Art of 
War). 


Section 5.5 


In contrast with the convention of (5.15), some authors use the notation 
|x| for the order of a group element x. For an element Ng of the quotient 
G/N of a group G by a normal subgroup N, this notation leads to confusion 
between the order |(Ng)| of the quotient group element Ng and the size |Ng| 
of the coset Ng. 


Section 5.6 
A. Cayley was an English mathematician who lived from 1821 to 1895. 


Section 5.8 


R. Hamming was an American mathematician who lived from 1915 to 1998. 
L. Euler was a Swiss mathematician, later moving to Russia, who lived from 
1707 to 1783. 
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Chapter 6 


RINGS 


Many of the sets encountered so far — such as the integers Z, the reals R, or 
the 2 x 2 real matrices R3 — have carried an additive group structure and 
a multiplicative semigroup or monoid structure. These two structures often 
combine to form a richer structure, known as a ring. 


6.1 Rings 


In Definition 6.1 below, a ring is defined as a set with two operations, an 
addition z+ y and a multiplication x- y (or often just xy) of elements x and y 
of R. In compound expressions involving both additions and multiplications, 
the multiplications are to be carried out first (following the convention used 
when working with integers and real numbers). For example, the right-hand 
side of (6.1) below is computed as (a-r) + (y-1r), and not as (r+y)r. We 
say that the multiplication binds more strongly than the addition. 


DEFINITION 6.1 (Distributive laws, unital and nonunital rings.) 
Suppose that a set R carries a (commutative) additive group structure (R,+,0) 
and a multiplicative semigroup structure (R,-). 


(a) The combined structure (R,+,-) is said to satisfy the right distributive 
law if 


(aty)-r=a-r+y-r (6.1) 
for all x, y, rin R. 
(b) The structure (R,+,-) is said to satisfy the left distributive law if 
r-(a+ty)=r-ux4+r-y (6.2) 
for all x, y, rin R. 


(c) The structure (R,+,-) is said to be a (nonunital) ring if it satisfies both 
the right and left distributive laws. 
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(d) A ring (R,+,-) is said to be a (unital) ring if it forms a monoid (R,-,1) 
under multiplication. 


(e) A ring (R,+,-) is said to be commutative if the semigroup (R,-) is 
commutative. 


REMARK 6.2. Note that the group structure (R,+,0) of a ring (R,+,-) is 
always commutative. The issue of commutativity in a ring — Definition 6.1(e) 
— only arises in connection with the semigroup structure (R,-). For a com- 
mutative ring (R,+,-), the left and right distributive laws coincide. In a 
general ring (R,+,-), to say that two elements x and y commute means that 
x-y=y-2x (compare Definition 4.2, page 68). (] 


The identity element 0 of the additive group (R,+,0) of a ring (R,+,-) is 
known as the zero of the ring R. If R is unital, then the identity element 1 of 
the monoid (R,-, 1) is known as the identity or the one of the ring R. Unital 
rings are sometimes described as rings with a one, while nonunital rings are 
called rings without a one. In a unital ring R, the invertible elements — the 
units of the monoid (R,-,1) — are called the units of the ring R. The group 
of units of a unital ring R is written as R*, consistently with the notation of 
Proposition 4.20 (page 72). 


REMARK 6.3 According to Definition 6.1(c), all rings are nonunital, 
regardless of whether they do or do not possess an identity element. When 
a ring R which actually has an identity element is described as “nonunital,” 
the identity element is being disregarded. 


Example 6.4 (Integers.) 
The integers form a unital commutative ring (Z, +,-) under the usual addition 
and multiplication. Note that the right distributive law 


(m+n)r = mr + nr 


reduces to the Law of Exponents in the additive group (Z,+,0) — compare 
Exercise 29 in Chapter 5. 


Example 6.5 (Reals.) 
The set R of real numbers forms a commutative, unital ring (R,+,-) under 
the usual addition and multiplication. ] 


Example 6.6 (Zero rings.) 
Let (A,+,0) be an abelian group (written additively). Define a new, trivial 
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multiplication on the set A by 
roy=O0 


for all z, y in A. Then (A,+,-9) is a nonunital, commutative ring, known 
as the zero ring on the abelian group (A,+,0). Note that the distributive 
laws are satisfied trivially, since each side of the equations (6.1) and (6.2) in 
A reduces to 0. 


Example 6.7 (The trivial ring.) 
The zero ring on the trivial abelian group {0} is unital, with 0 as the identity 
element. It is known as the trivial ring. 


The following example does not exhibit a ring. It shows that having an 
additive group structure and a semigroup structure connected by one of the 
distributive laws is not enough to guarantee that the other distributive law 
will hold (and thus to yield a ring). 


Example 6.8 
Let (A,+,0) be a nontrivial additive group, say with nonzero element a. 
Define the semigroup operation 
tsy=y 
on A, as in Example 5.33 (page 110). Note that the left distributive law (6.1) 
holds trivially, since each side reduces to 
ct+y. 
On the other hand, the right distributive law reduces to 
r=r+r, 


which does not hold for r = a. (] 


Example 6.9 (Integers modulo d.) 

Let d be a positive integer. Then the set Z/dZ or Z/q of integers modulo 
d forms a commutative unital ring (Z/q,+,-) under modular addition and 
multiplication. Using (3.21) and (3.22), the distributive law for Z/q follows 
from the distributive law for the integers (Exercise 2). Note that for d= 1, 
the ring Z/q is the unital zero ring of Example 6.7. 


Example 6.10 (Matrix rings.) 
For a nonunital ring R, let R3 denote the set of 2 x 2 matrices 


T11 712 
T21 T22 
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with entries rj; from R. Then R3 forms an additive (commutative) group 
under the componentwise addition (4.17), and a noncommutative semigroup 
under the usual multiplication (2.7) of matrices. The distributive laws also 
hold (Exercise 3). If the ring R is unital, then so is the corresponding matrix 
ring R3. Its identity element is the matrix 


in which the entries are the zero and identity of the unital ring R. l] 


Example 6.11 (Direct products.) 

Let (R,+,-) and ($,+,-) be nonunital rings. The product group (R x S,+) 
and semigroup (R x S,-) combine to form a ring (R x S,+,-), the (direct) 
product of the rings R and S. Note that the distributive laws in the product 
follow componentwise from the distributive laws in the factors R and S (Ex- 
ercise 5). If R and S are unital, then R x S is unital, with componentwise 
identity element (1,1). 


Example 6.12 (Function rings.) 

Let X be a set, and let ($,+,-) bearing. According to Definition 4.34 (page 
77) and Exercise 23 in Chapter 4, the set S* of all functions f : X — S from 
X to S carries a componentwise additive group structure ($*,+,z) — with 
the constant zero function 


z:X 3 S;rr 0 


— and a componentwise semigroup structure (S*,-). Now for functions 
f,g,h: X — S, the right distributive law in S implies 


[(f +9): h](x) = [f(x) + g(@)] - h(a) 
= f(a) -h(w) + g(a) -h(w) = [f-h+g- h(a) 


for each element x of X. Thus the right distributive law 


(f+tg):h=f-h+g-h 


holds in ($*,+,-). By a similar argument, the left distributive law also holds 
(Exercise 6). The set S* becomes a ring, the X-th power of the ring S$, or 
the ring of S-valued functions on the set X. If S is unital, then so is the 
power S*. Its identity element is the function u: X — S;x+> 1 which takes 
a constant value of the identity in S at each element x of X. For example, 
the set R® of all real functions f : R — R forms a unital ring. 
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6.2 Distributivity 


What is the significance of the distributive laws in a ring (R,+,-)? For an 
element r of R, consider the left multiplication 


Ro Rawvr-sz (6.3) 


by r. The left distributive law (6.2) states that the function (6.3) is a semi- 
group homomorphism from (R, +) to itself. By Proposition 5.5, it follows that 
the left multiplication by r is a group homomorphism from the additive group 
(R,+,0) of R to itself. In other words, the multiplication preserves the zero: 


r-0=0 (6.4) 


and the negation: 


for s in R. Furthermore, 
r-(w-y)=r-a-—r-y (6.5) 
for x and y in R (Exercise 7). In similar fashion, the right multiplication 
R= Rxw2-s (6.6) 


by an element s of R is also a group homomorphism from (R,+,0) to itself. 
Thus 
(-r)-(-s)=r-s (6.7) 


holds for any r and s in R (Exercise 8). Another useful property is the 
equation 
r-0=0=0-r (6.8) 


for any element r of R. Indeed, 
O+r-0=r-0=r-(0+0)=r-04r-0, 


the first two equations holding by the group axioms, and the third by left 
distributivity. Cancellation in the group (R,+,0) then yields 0 = r-0. The 
other equation in (6.8) is proved similarly (Exercise 9). 

In aring (R,-+,-), it is useful to have a so-called sigma notation for repeated 
sums. Let m be an integer. Suppose that x; is an element of R, for integers 
i=m,m+1,m+2,.... By induction on n, define 


L 
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for any integer | < m, and 


n+1 


Yona aunt Poa 


=m 


Thus 
5 
So aj =a ta.+23+24425, 
i=1 


for example. Note that the index 7 in the sigma notation is a bound or dummy 
variable, which may be replaced by any other symbol, e.g., 


Using the sigma notation, we formulate an extension of the distributive laws. 
(The proof by induction is assigned as Exercise 11.) 


PROPOSITION 6.13 (Generalized distributive law.) 


Let x; and y; be elements of a ring R, fori =1,2,.... Then 
(Sta) (Som) =< eas (69 
i=1 j=l i=1 j=l 


for natural numbers m and n. 


COROLLARY 6.14 
Let x and y be elements of a ring (R,+,-). Then for integers m and n, 


(ma) - (ny) = (mn)ay . (6.10) 


PROOF The proof divides naturally into four cases: 
e For m,n > 0, set x; = x and y; = y in Proposition 6.13. 
e Form <0, > 0, set x; = —x# and y; = y in Proposition 6.13. 
e For m>0,n <0, set x; = x and y; = —y in Proposition 6.13. 
e For m,n <0, set x; = —x and y; = —y in Proposition 6.13. 


Note that (6.10) is trivial if m = 0 or n=0. (] 
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6.3. Subrings 


The concept of a subring in a ring combines the concepts of subgroups, 
subsemigroups, and submonoids. 


DEFINITION 6.15 (Unital and nonunital subrings.) 


(a) A subset S of a nonunital ring (R,+,-) is said to be a (nonunital) sub- 
ring of R if S is a subgroup of (R,+,0) and a subsemigroup of (R,-). 


(a) A subset S of a unital ring (R,+,-) is said to be a (unital) subring of 
R if S is a subgroup of (R,+,0) and a submonoid of (R,-,1). 


It is often left implicit as to whether a subring is asserted to be nonunital 
or unital. For example, in any ring R, the subset R itself forms a subring, 
the improper subring. It will be always be a nonunital subring, and will be 
a unital subring if R itself is unital. On the other hand, the trivial ring {0} 
(compare Example 6.6) is a nonunital subring of each ring R. Although the 
trivial ring is unital, it is not a unital subring of any nontrivial unital ring (in 
which 0 # 1). 


Example 6.16 (Subrings of the integers.) 

By Theorem 4.46 (page 81), each subgroup of the additive group (Z,+,0) of 
integers is the set dZ of multiples of some natural number d. Since the divis- 
ibility relation is transitive, each subgroup of (Z,+,0) is also a subsemigroup 
of (Z,-), and hence a nonunital subring of the unital ring of integers. In fact, 
since 1Z = Z, the only unital subring of Z is the improper subring. 


To check that a subset S of a ring R is a (nonunital) subring of R, Propo- 
sition 4.43 and Remark 4.44 (page 80) show that three properties of S have 
to be verified: 


e S is nonempty; 
e cand yin Simply «—yin S; 
e cand yin Simply «-yin S. 


Example 6.17 (The ring R{i|, complex numbers, Gaussian integers.) 
Let R be a unital ring, and let R[i] be the set of 2 x 2 matrices of the form 


F2 


134 Introduction to Abstract Algebra 


for x and y in R. Then R[#] is a unital subring of the ring R3 of all 2 x 2 
matrices over R. Certainly the identity matrix J, lies in R[i], and R[] is 
closed under the (componentwise) subtraction of matrices. Now 


Pelee wallace 


so that R[i] is also closed under multiplication. If R is commutative, it follows 
from (6.11) that R[i] is also commutative. Here are two important special 
cases: 


e The ring R[?] is the ring C of complex numbers (compare Study Project 1); 
e The subring Z/i] of R[é] is known as the ring of Gaussian integers. 


Also, see Example 7.18 (page 163). l] 


Let R bea unital ring, with the identity element (exceptionally) denoted by 
u. The cyclic subgroup (u) of (R,+,0) generated by u, the set of all integral 
multiples nu of u, forms a unital subring of (R,+,-). Indeed for integers m 
and n, Corollary 6.14 yields 


(mu) - (nu) = (mn)(uu) = (mn)u. 


The subring (u) is known as the prime subring of R. If u has infinite order, 
the ring R is said to have characteristic 0. If u has finite order d, the ring R 
is said to have characteristic d. 

Let y be an element of a unital ring (R,+,-). Continuing to write u for the 
identity element of R, and considering a natural number m, Corollary 6.14 
yields 

(mu)x = m(ux) = ma (6.12) 


on setting n = 1 and x = u. Similarly, we get 
x(mu) = m(ru) = mx. (6.13) 
Thus: 
PROPOSITION 6.18 (Characteristic and additive order.) 
Let (R,+,-) be a unital ring. 


(a) Each element of the prime subring of R commutes with each element of 
R. 


(b) If R has finite characteristic d, then in the abelian group (R,+,0), each 
element x has finite order dividing d. 
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In a unital ring R with identity wu and characteristic d, (6.12) and (6.13) 
imply that we may consistently identify the multiple nu with the element 
n-+dZ of the integers modulo d for each natural number n. Thus the prime 
subring of R may be considered as the set {n | n in Z}, with m = n if and 
only if d divides m—n. 


Example 6.19 

Consider the ring Z/2Z x Z/2Z, the direct product of two copies of the ring 
of integers modulo 2. This ring is unital, with identity element (1,1). The 
prime subring is formed by the diagonal subgroup (compare Exercise 26 in 
Chapter 5). The characteristic of Z/2Z x Z/2Z is 2. 


6.4 Ring homomorphisms 


Let (R,+,-) and (S,+,-) be (nonunital) rings. 


DEFINITION 6.20 (Ring homomorphisms, isomorphisms. ) 


(a) A function f : R > S is said to be a (nonunital) ring homomorphism 
if it forms both a group homomorphism f : (R,+,0) — ($,+,0) and a 
semigroup homomorphism f : (R,-) > (S,-). 


(b) If R and S are unital rings, then a ring homomorphism f : R > S 
is described as unital if f(1) = 1, so that f : (R,-,1) > (S,-,1) ts a 


monoid homomorphism. 


(c) A bijective ring homomorphism is called a (ring) isomorphism. The 
notation R= S means that rings R and S are isomorphic. 


The image f(R) of a ring homomorphism f : R — S is a subring of the 


codomain ring S (nonunital or unital, according to whether f is nonunital or 
unital). 


Example 6.21 (Inclusion of subrings.) 
Let S be a (nonunital) subring of a ring R. Then the inclusion map 


J: SCR tH 2 


from S$ to R is a (nonunital) ring homomorphism. If R is unital, and S$ is a 
unital subring, then 7: S — R is a unital rig homomorphism. 
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Example 6.22 (Abelian group homomorphisms.) 
Let (A,+,0) and (B,+,0) be abelian groups. Let f : (A,+,0) — (B,+,0) 
be a group homomorphism. Then there is a (nonunital) ring homomorphism 


bi : (Ayy*0) = (By+;-0); 2 f(x) 


of the corresponding zero rings. (] 


Example 6.23 (Projections and insertions.) 
Let R and S be rings. Then the projections 


m:RxSoR (aya 
and 
T:RxS-S;(a,yroy 


are ring homomorphisms (compare Example 5.4). If R and S are unital, the 
projections are unital ring homomorphisms. The insertions 


4: R—>Rx S;x (2,0) 


and 
2: S—>Rx S;y+ (0,y) 


are also ring homomorphisms. (The notation is quite standard: v is the Greek 
letter “iota.”) However, even if R and S are unital, the insertion 1; is only 
unital if S is trivial, since the identity of R x S is (1,1). 


Example 6.24 (Determinants.) 
Although the determinant function (5.2) is a monoid homomorphism for the 
usual matrix multiplication, it does not form a ring homomorphism, since 


det(A+ B) 4 det A+ det B 


for general matrices A and B. (] 


Example 6.25 (Scalar multiples of the identity matrix.) 
Let R be a unital ring, and let R[t| be the corresponding ring of matrices 
from Example 6.17. Then there is an injective unital ring homomorphism 


e: R= Riij;r k " (6.14) 


Note that for each element x of R, the image e(x) is the scalar multiple ry 
of the identity matrix Iz by the scalar x. It is convenient to identify the ring 
R with its isomorphic image e() under (6.14), so that R becomes a subring 
of the ring R[z]. 
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We conclude this section with a further examination of the relationship 
between unital and nonunital rings. It has already been noted that a unital 
ring becomes nonunital, simply by disregarding the special role of the identity 
element (Remark 6.3). Now suppose that R is a nonunital ring. Write R+Z 
for the direct product set {(r,m)|r in R, m in Z}. with componentwise 
additive group structure. Define a multiplication on R+ Z by 


(r,m) -(s,n) =(rs+ms+nr,mn). (6.15) 


This multiplication is associative, and distributive over the componentwise 
addition on each side (Exercise 25). Now (0,1)-(r,m) = (r,m) = (r,m)- (0,1) 
for rin Rand m in Z, so R+Z becomes a unital ring (R+Z, +,-) with identity 
element (0,1). Consider the map eg : R > R+Z;r + (r,0). This map is 
clearly an injective group homomorphism, and from (6.15) it is also seen to 
be a semigroup homomorphism. Thus f is an injective ring homomorphism. 
A nonunital ring R is often identified with its isomorphic image er(R) under 
er. One may then say that each nonunital ring R embeds into a unital ring 
R+Z. 


6.5 Ideals 
Consider a ring homomorphism f : R — S. The group kernel Ker f or 
J={x in X| f(x) =0} 
has the so-called absorptive property: 
jin J and « in R imply ja, 2 in J. (6.16) 
Indeed, for 7 in J and x in X, 
f(ja) = FU) Fe) =0- f(e) =0 = f(a) -0 = Fla) f(i) = fai). 

DEFINITION 6.26 (Ideal of a ring.) A subset J of a ring (R,+,-) is 
said to be an ideal of R, written J<R, if: 

e It forms a subgroup of (R,+,0); 


e It satisfies the absorptive property (6.16). 


Thus we have the ring-theoretic analogue of Proposition 5.11. 


PROPOSITION 6.27 (Kernels of homomorphisms are ideals.) 
The group kernel Ker f of a ring homomorphism f : R— S forms an ideal 
in the domain ring R. 
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In the context of Proposition 6.27, the group kernel Ker f is described as 
the ring kernel of the ring homomorphism f : R— S. 


Example 6.28 (Ideals of zero rings.) 

Let (A,+,0) be an (additive) abelian group. If J is a subgroup of A, then J 
trivially satisfies the absorptive property (6.16) under the multiplication -9 of 
the zero ring (A,+.-9). Thus J forms an ideal of the zero ring. In fact, J is 
the ring kernel of the ring homomorphism (A, +,-9) — (A/J,+,-0) furnished 
by the group homomorphism 


A> AlJ;nreat+d 
(compare Example 6.22). (] 


Since an ideal J of a ring R is by definition a subgroup of (R,+,0), the 
absorptive property (6.16) specializes (considering x from J rather than from 
anywhere in R) to show that ideals are certainly subrings. By (6.8), it is 
seen that the trivial subring of a ring R forms an ideal of R. The improper 
subring R of R also forms an ideal, the improper ideal. If R is a unital ring, 
the absorptive property (6.16) shows that an ideal containing the identity 
element is improper. 


DEFINITION 6.29 (Simple rings.) A ring R is said to be simple if 
it has no proper, nontrivial ideals. 


Example 6.30 (Rings of prime order are simple.) 

Let R be a ring with a prime number of elements. By Proposition 4.51, the 
additive group (R,+,0) of the ring R has no proper, nontrivial subgroups. 
Thus the ring R has no proper, nontrivial ideals: It is simple. For instance, 
the ring of integers Z/pZ (compare Example 6.9) modulo a prime number 
p forms a simple ring. Similarly, the zero ring (A,+.-9) determined by an 
abelian group with a prime number of elements is also simple. 


In contrast to simple rings, the ring of integers has many ideals. 


Example 6.31 (Ideals in the ring of integers.) 

Each subring of the ring (Z,+,-) is just the set dZ of multiples of a certain 
natural number d (Example 6.16). Since divisibility is a transitive relation, it 
follows that each subring of Z is actually an ideal of Z. Conversely, since each 
ideal is a subring, each ideal is of the form dZ for some natural number d. (] 


The divisibility relation on the set of natural numbers translates nicely to 
the subset relationship on the set of ideals of Z. 
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PROPOSITION 6.32 (Ideals and divisibility.) 
Let c and d be natural numbers. Then d divides c if and only if the ideal cZ 
is a subset of the ideal dZ. 


PROOF § If cis a multiple of d, then c lies in dZ, so the ideal cZ is a subset 
of the ideal dZ. Conversely, if cZ is a subset of dZ, then c- 1 = c belongs to 
the set dZ of multiples of d. 


The concept of an ideal arose in an attempt to extend divisibility properties 
of integers to more general rings. It transpired that the sets of multiples 
were more amenable than the actual elements themselves, so these sets were 
considered as “ideal numbers.” 


6.6 Quotient rings 


If J is an ideal in a ring (R,+,-), then we may form the quotient group 
R/J, the set of cosets « + J of the (normal) subgroup J in the abelian group 
(R,+,0). Now since J is an ideal in R, there is a well-defined semigroup 
multiplication on R/J given by 


(c+ J)(y+J) =aryt+ J 


for xz, y in R. Suppose that the respective cosets x + J and y+ J are also 
represented by ring elements x’ and y’, so that « = 2+) and y’ =y+k for 
elements j and k of J. Then 


vy +I =(e+j\yth) +I =ayt (wht jytjh+J)=ayt J, 


since ck + jy + jk is an element of the ideal J. The distributivity of the 
multiplication over the addition in R/J follows from the distributivity in R 
(Exercise 34). Thus the quotient group R/J actually forms a quotient ring. 
The group homomorphism 


s:RoR/J;reart+J (6.17) 


becomes a ring homomorphism. If R is unital, then so is the quotient R/J, 
with identity element 1+ J. In this case the function (6.17) becomes a unital 
ring homomorphism. 


Example 6.33 (Modular arithmetic.) 

For a positive integer d, the ring Z/dZ of integers modulo d (Example 6.9) is 
the quotient of the ring Z of integers by the ideal dZ consisting of all multiples 
of d. 
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Just as for groups (Section 5.4), a ring homomorphism factorizes under a 
strengthened version of the First Isomorphism Theorem. 


THEOREM 6.34 (First Isomorphism Theorem for Rings.) 
Let f : (R,4+,-) — (S,4,-) be a ring homomorphism. 


(a) The group kernel K = f~*{0} is a ideal in the domain ring R. 
(b) The image f(R) is a subring of the codomain ring S. 


(c) In the factorization 
fHjobos 


given by the First Isomorphism Theorem for Sets, the surjection s may 
be taken as the surjective homomorphism 


s:X 3 X/K;xn2a4+k 
of (6.17), the bijection b is the well-defined ring isomorphism 
b: R/K > f(R)s2+ K+ f(o) 


from the quotient R/K to the image f(R), and the injection j is the 
injective ring homomorphism 


j: f(R) > S; f(z) > f(a) 
of Example 6.21. 


Example 6.35 (Kernels and images of projections and insertions.) 
Let R and S be rings. In Example 6.23, the ring kernel of the surjective 
projection 72: Rx S — S is the image 1;(R) of the ring R under the insertion 
t,. Thus the First Isomorphism Theorem exhibits an isomorphism 


(Rx S8)/u(R) 2S. 


Similarly, (R x S')/t2($) = R (Example 35). l] 


6.7 Polynomial rings 


Let R be a ring. An indeterminate X (over R) is a symbol that is not 
related to any element of R. A polynomial over R in an indeterminate X is 
an expression of the form 


P(X) = py X” + pn-1X”"* + +++ + poX? + pi X + po. (6.18) 
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Here n is a natural number, while p,, pn—1, ...,; P2, Pi, Po are elements 
of R known as the coefficients of the polynomial p(X). Specifically, p; (for 
0 <i <n) is called the coefficient of X* in p(X). The individual summands 
p,X* in (6.18) are called the terms of the polynomial. (The final summand 
po may be written as po X°.) If R is unital, a term 1X‘ may be written just 
as X*. If a coefficient p; happens to be zero, then the term p;X* need not be 
written explicitly. For example, 1X?+1 and X?+2X +1 denote the same 
polynomial if R is the ring of integers modulo 2. Continuing this convention, 
two polynomials, say p(X) as in (6.18) and 


q(X) = qmX™ +--+ +mX +40, (6.19) 
are defined to be equal if 


Pr = 4r, Pr-1 = Ur-1; +++, P2= 42, P1 = 41, Po = Go 


for r = min(m,n) and 
p=9, “=0 


for i > r. (This is the process of equating coefficients.) For example, the 
polynomials 
2X7 +X-3 


and 
OX? + 25X72 +X —3 


are equal if R is the ring Z of integers. The set of all polynomials in X over 
R is written as R[X]. It includes the ring RF itself, as the set of constant 
polynomials po. (Note that the single zero coefficient cannot be omitted from 
the constant polynomial 0.) 

The set R[X] of polynomials in X over R inherits a componentwise additive 
abelian group structure (R[X],+,0) from R. Thus if n = m, the sum of p(X) 
from (6.18) and g(X) from (6.19) is 


(p + q)(X) = Ge Qn) Xx” (Pn—1 t Gnu) Xe. 
++ (po + q2)X? + (pi + 1) X + (po + 40)- 


The negative —p(X) of p(X) is the polynomial 


(—p)(X) = —PnX” — Pn-1X"~* — +++ — poX* — 1X — po 


obtained from p(X) by negating all the coefficients. Using the sigma notation 
introduced in Section 6.2, the polynomial p(X) of (6.18) may be written as 


p(X) = >) p.X* 
1=0 


(admitting 7 = 0 as a possible index). 
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A semigroup structure (R[X],-) will now be defined, in such a way that the 
multiplication - is distributive over the addition in the group (R[X],+,0). To 
specify such a multiplication, it suffices to set 


X*. XI = xX" 
and 
r-X'=rX'=X'-r 


for elements r of R and for natural numbers 7, 7. Thus for the polynomials 
p(X) from (6.18) and g(X) from (6.19), distributivity gives 


(Sp.x') (Lax') = 3 (Sorm)* (6.20) 


Note that coefficients such as pn+m appearing in (6.20) (on the right-hand 
side for k = n+m and j = k), but not in (6.18) or (6.19), are taken to be 
zero. In effect, (6.20) is modeled on the usual product of real polynomials in 
R[X], systematically collecting all the terms involving a specific power X" of 
the indeterminate X. For example, (6.20) gives 
(5X + 1)(7X? + 3X +2) 
=(1-2)+(1-34+5-2)X+(1-74+5-3)X? + (5-7)X? 
= 35X° + 22X? + 13K +2. 


The multiplication may be displayed schematically as follows. 


Te 3X 2 


1 joey 1-3X 1-2 


Equal powers of the indeterminate X are collected along the diagonal stripes, 
corresponding to the right-hand side of (6.20). 
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THEOREM 6.36 

Let R be a ring, and let X be an indeterminate. 
(a) The set R[X] of polynomials over R in an indeterminate X forms a ring 
with componentwise additive group structure, and with the multiplication 
(6.20). 


wa 


(b) The ring (R,+,-) is a subring of (R[X],+,:), namely the subring of 
constant polynomials. 


(c) If R is commutative, then the ring (R[X],+,-) is commutative. 


(d) If R is unital, then the ring (R[X],+,-) is unital, with identity element 
given by the constant polynomial 1. 


PROOF Most of Theorem 6.36 is straightforward. We will just see how 
the associativity of the multiplication (6.20) is verified. To do this, it is helpful 
to rewrite (6.20) in the form 


n+tm 


(don.x') : (Lo) = ( S- igy )X" (6.21) 


h=0 it+j=h 


with the convention that sums like those in the coefficient of X* on the right- 
hand side of (6.21) are understood to be taken over all natural numbers 7 and 
j satisfying the specified condition i+7 = k. Consider the polynomials (6.18), 
(6.19), and 
l 
r(X) = S- ree”. 


k=0 


Using the new convention, we have 


lI 
We 
Ne 
3 
& 
n>_” 
ms 
nee, > 
7 ™~ 
iM- 
> 
Se 
ca 
pp a 


h=0 i+j=h 0 
nt+m-4l 
= | ; ( a paas ra] X* 
g=0 h+k=g i+tj=h 
n+m-+4l 
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nt+tm4+l 


=> [Dal © am)|x 


g=0 ‘ith=g jtk=h 


(Smxt) [Fe (5 an) x] 


h=0 j+k=h 


- (Son). [(Eox)- (Sax) 


k=0 


as verification of the associative law in R[X]. (] 


DEFINITION 6.37 (The ring of polynomials.) The ring (R[X],+,-) 
of Theorem 6.86 is called the polynomial ring or ring of polynomials over R 
in the indeterminate X. 


Example 6.38 (Binomial coefficients.) 
Let n be a natural number. The binomial coefficients are defined as the 
coefficients (") of the polynomial 


(1+ X)" = 53 (") x (6.22) 


in Z[X]. Note that 


for all natural numbers n. Further, 


()- 


for r > n, by the convention allowing additional zero terms in the polynomial 
(6.22). Since 


(Lae re SE aS (6.23) 


by (5.10) in the monoid (Z[X],-,1), we obtain the recurrence relation 


n+1 n n 
= 24 
i) Ge 20) 
for natural numbers r from the definition (6.20) of the multiplication in the 
polynomial ring Z[X], equating coefficients of X'+! on each side of (6.23). [J 
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6.8 Substitution 


In Section 6.7, polynomials were described as formal combinations of powers 
of an indeterminate X. On the other hand, polynomials appear in calculus 
as special kinds of functions. Functions are obtained from polynomials as 
follows. 


THEOREM 6.39 (Substitution Principle.) 
Suppose that 
g:R-S 


is a homomorphism between commutative rings R and S. Then for each fixed 
element c of the ring S, there is a unique homomorphism 


6.: R[X] — S 
with 0.(X) =c. If8@: R— S is a homomorphism of unital rings, then so is 
0.: RIX] > S. 
PROOF Consider a polynomial 
P(X) = prX” + pn-rX”"* +--+ + poX? + pi X + po 
in R[X]. If 0. : R[X] — S is any ring homomorphism with @.(X) = c, then 
8c(p(X)) = O(pn)e” + O(pn—i)e"* + +++ + O(p2)e” + O(pi)e + (po), (6.25) 


so that 0. is unique. Indeed, since 0: R — S is a ring homomorphism, it is 
straightforward to check that the map 


6.: R[X] > S$ 
defined by (6.25) is a ring homomorphism (Exercise 37). (] 


DEFINITION 6.40 (Evaluation, roots.) Let S be a subring of a ring 
R, and let 


J: SCR tHe 
be the inclusion map from S to R (compare Example 6.21). Let p(X) be a 
polynomial in S|X]. 


(a) For each element c of R, the image j-(p(X)) in R is written as p(c), 
and described as the value of the polynomial p(X) at the element c. 
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(b) The function 
R- R;c+ plc) 


is called evaluation of the polynomial p(X), or the polynomial function 
determined by the polynomial p(X). 


(c) The element c of R is said to be a root or zero of the polynomial p(X) 
if p(c) = 0. 


Example 6.41 
Consider the polynomial X* — 2 in Z[X]. Then the real number V2 is a root 
of X?-2inR. 


Example 6.42 
Consider the polynomial X? — 1 in Z/sg[X]. Then 1, 3, 5, and 7 are four 
distinct roots of the quadratic polynomial X? — 1 in Z/s. 


Example 6.43 

Consider the polynomial p(X) = X? +X in Z/2[X]. Then 0 and 1 are roots 
of X? +X in Z/2. In other words, although the polynomial p(X) is nonzero, 
it determines the zero function 


Z/2 = Z/2;0 0, Lr 0 
as its polynomial function. [] 


The final result shows that any particular choice of an indeterminate is 
irrelevant. 


PROPOSITION 6.44 (Indeterminacy of indeterminates.) 
Let R be a commutative ring. Let X and Y be indeterminates over R. Then 
the polynomial rings RX] and R[Y] are isomorphic. 


PROOF Let 7: R<@ R[X] be the inclusion of R as a subring of R[X] — 
compare Theorem 6.36(b). Let k : R — R[Y] be the corresponding inclusion 
in R[Y]. Then the ring homomorphisms 


jy : R[X] — RLY]; p(X) & p(y) 


and 
kx : RIY] > R[X];p(Y) & p(X) 


are mutually inverse. l] 


6.9 


12. 


13. 


14. 
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Exercises 


. Give an example of a set A with an additive group structure (A,+,0) 


and a semigroup structure (A,-) such that the right distributive law is 
satisfied, but not the left. Justify your claims. 


. Let d be a positive integer. Use (3.21) and (3.22), along with the right 


distributive law for (Z,+,-), to give a formal proof of the left distributive 
law in (Z/dZ,+,-). 


. Verify the claims of Example 6.10. 


. Let R be a zero ring (as in Example 6.6). Show that the ring R3 of 2 x 2 


matrices over R is commutative. 


. Verify the claims of Example 6.11. 


. Let X be a set, and let S be a ring. Give a careful proof that the left 


distributive law holds in (S*,+,-). 


. Prove that (6.5) holds in each ring R. 

. Prove that (6.7) holds in each ring R. 

. Prove that the right-hand equation in (6.8) holds in each ring R. 
. Write the left-hand side of (1.8) using sigma notation. 


. Prove Proposition 6.13. 


Show that the right and left distributive laws are both special cases of 
the generalized distributive law. 


Suppose that 


LL= 2X 
for each element x of a ring R. 
(a) Show that 
zr+x2x=0 


for all x in R. 


(b) Show that R is commutative. 


Let S be a subring of a ring R. Show that $3 is a subring of R3. 
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15. 


16. 


17. 


18. 


19. 


20. 
21. 


22. 
23. 
24. 


25. 
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Let X be a subset of a ring R. Show that 
Cr(X)={r in R|rx=ar forall « in X} 


is a subring of R. [The subring Cp(X) is known as the commutant of 
X in R] 


Let S be the set of all functions f : R — R with f(0) = 0. Show that 
S forms a nonunital subring of the function ring R® of all real-valued 
functions. Does it form a unital subring? Does it form a unital ring? 


Consider the subset S = {0,2,4,6,8} of Z/10Z. 
(a) Show that S = {0,2,4,6,8} forms a subring of Z/10Z. 


(b) Show that S forms a unital ring. 
(c) Show that S$ is not a unital subring of Z/10Z. 


Specify the group of units of the monoid of Gaussian integers under 
multiplication. 


Let d be a positive integer. Consider the ring (Z/q)3 of 2 x 2 matrices 
over the ring Z/q of integers modulo d. 


(a) Show that the prime subring of (Z/a)3 is the set 
x0 
Ox 


of diagonal matrices with equal diagonal entries. 
(b) Show that the characteristic of (Z/_)3 is d. 


ein Za} 


What is the prime subring of the ring R2 of 2 x 2 real matrices? 
Suppose d > 1. 


(a) Show that the ring Z/dZ x Z/dZ is unital, with identity element 
(1,1). 
(b) Show that the characteristic of Z/dZ x Z/dZ is d. 
What is the characteristic of the ring Z/4Z x Z/2Z? 
What is the characteristic of the ring Z/3Z x Z/2Z? 


Suppose that x? = for each element x of a unital ring R. Show that 
R has a finite characteristic d that is a divisor of 6. 


Verify that the multiplication (6.15) is associative, and that it distributes 
on each side over the componentwise addition on R + Z. 


26. 
27. 


28. 


29. 


30. 


3l. 


32. 
33. 


34. 


35. 
36. 


37. 
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Let R be a ring. Identify the prime subring of the unital ring R + Z. 
Let r be an element of a unital ring R. Show that 

(r,0) - (1,-1) = (0,0) 
in R+Z. 


Let R be a unital ring having finite characteristic d. Determine the 
characteristic of the unital ring R + Z. 


Let R be a ring. Show that there is a surjective ring homomorphism 
p:R+Z5Z;(r,m)Hm. 
What is the ring kernel Ker p? 


Let x be an element of a commutative, unital ring R. Show that «FR is 
an ideal of R. 


Let x and y be elements of a commutative, unital ring R. Show that 
tR-+ yR is an ideal of R. 


Let a and b be positive integers. Show that aZ + bZ = gcd(a, b)Z. 
Let x be an element of a unital ring R. Show that 
cR+ Rxe+ ReR 
is an ideal of R. Note that ReR means {}7_9 sat; | nin N; s;,t; in R}. 


Let J be an ideal of a ring R. Show that the right and left distributive 
laws hold in R/J. 


Let R and S be rings. Show that (R x S)/12(S) = R. 


Let n, m, and r be natural numbers. Use the definition (6.20) of the 
product in the polynomial ring ZX] to prove the identity 


n+m “fn m 
eye 
for binomial coefficients. 
Complete the proof of Theorem 6.39: Show that the map 
6.: RIX] — S 


defined by (6.25) is a homomorphism of the additive group and semi- 
group (or monoid) structures on R[X] and S. 
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38 


39. 


40. 


Al. 


42. 


43. 


44. 


45. 


46. 
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Determine all the roots of the polynomial X? +1 in the ring Z/s of 
integers modulo 5. 


Determine all the roots of the polynomial X4* — X in the ring Z/4 of 
integers modulo 4. 


Prove the identity 
r 
r=0 


for natural numbers n. [Hint: Evaluate (6.22) at the integer 1.] 


Prove the identity 


r=0 
for positive integers n. 
Prove the identity 
n/2 n/2 
n n 
2b) 2 Gena) 


for even natural numbers n. 


For natural numbers r < n, show that 


(") = eeaat (6.26) 


In particular, conclude that (6.26) gives the number of r-element subsets 
of an n-element set. 


For a prime number p and an integer r with 0 < r < p, show that the 
binomial coefficient 

Pp 

i 


Let R be a commutative, unital ring of prime characteristic p. Show 
that the Frobenius map 


is divisible by p. 


p:R- Rar a? (6.27) 
is a ring homomorphism. 


Show that for each element r of a commutative, unital ring R, there is 
a unique unital ring homomorphism ¢: Z[X] > R with ¢(X) =r. 
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6.10 Study projects 


1. Complex numbers. Consider the commutative ring R[#] of all matrices 


z= ° =| (6.28) 


with real entries « and y (compare Example 6.17, page 133). These 
matrices are known as complex numbers, and the set R[i] is written as 
C. Recall that the ring R of real numbers appears as a subring of C, 
namely as the ring of scalar multiples xIy of the identity matrix Ig by 
real scalars x (Example 6.25, page 136). There is an additive group 
isomorphism 


C > R?; ° | ++ (x,y) (6.29) 


from the set of complex numbers z to the real plane R?. Each complex 
number (6.28) is often identified with its image (x, y) in the plane. 


Define the modulus of the complex number z as the square root 


1 
= 2: 
|z| = { det k | \ =4fa" ty? 
of the (nonnegative) determinant. In the plane representation (6.29), 
the modulus of z is the distance of the point (x,y) from the origin 
(0,0). Write the transpose of the matrix z as the complex conjugate 


In the plane representation (6.29), complex conjugation corresponds to 
reflection (x, y) +> (a, —y) in the x-axis. 
(a) Show that 


[zeae a 


(b) Show that 
z=0 ifandonlyif |z|=0. 


(c) Show that the set 
SS {2 t= ai} (6.30) 


of complex numbers of unit modulus is the special orthogonal group 
SO2(R) — compare Study Project 2 in Chapter 4. Note that in 
the plane representation (6.29), the set (6.30) is the unit circle, the 
circle of points at radius 1 from the origin. 
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(d) Given a second element 


NS 


WH 


wa 


of C, show that 

|z- wl] = |2|-|w. 
{Hint: Use Example 5.7 (page 97) and Exercise 7 of Chapter 5.] 
Rewrite the multiplication formula (6.11) for the product z-w 
of two complex numbers z and w using the plane representations 
(x,y) and (u,v). 
(Note: Although this format is often used for the definition of the 
product of complex numbers, it gives no hint as to why the product 
should be associative. On the other hand, the definition of the 
complex numbers as a subring of the ring of real 2 x 2 matrices 
removes the need for any extra verification. It also enables the 
modulus and complex conjugation to be reduced to the standard 
matrix concepts of determinant and transpose.) 


For the complex number 


show that i? = —1 (recalling that the identity 1 of C is the matrix 
Ip). 
Show that each complex number can be written in the form 


e-y| 
b i) aeaetiv. (6.31) 


Show that each nonzero complex number z is invertible, and that 
its inverse is given by 


le eae, (6.32) 


For a nonzero complex number z, show that the complex number 
|z|~1z is an element 

cos 8 — sin @ 

i @ cosé 


of the unit circle S?. 


Show that each nonzero complex number z can be written in the 
polar form 
z =r(cos@ + isin@) 


with real r = |z| > 0 and 0 < 0 < 2z. 
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2. Check digits. The rings Z/q of integers modulo a positive number 
d are often used to mitigate the effect of errors in the recording and 
reporting of long sequences of digits. For example, consider a 13-digit 
EAN barcode, as illustrated in Figure 6.1. 


BME OVO 9.9) O04 196.92: 0 


FIGURE 6.1: A (simulated) 13-digit EAN barcode. 


Here, it is actually the sequence 


a12 ] Q11 G10 .--. a6 ! a5... a2 aj ! (6.33) 
of the left-most 12 decimal digits which carries the information (type of 
product, manufacturer, etc.). The final digit ag is known as the check 


digit. It is chosen so that the equation 
1-ajg2+3-a1,+1-a19 +---+1-a2+3-a,+1-a9 =0 (6.34) 
is satisfied in the ring Z/19 of integers modulo 10. 


(a) Confirm whether the condition (6.34) is satisfied by the example 
illustrated in Figure 6.1. 


(b) Confirm whether the condition (6.34) is satisfied by the 13-digit 
EAN barcode on this book. 


(c) Show that for each 12-digit sequence (6.33) of decimal digits, there 
is a unique check digit a9 that may be added to ensure the satis- 
faction of (6.34). 


(d) Suppose that a full 13-digit sequence 


a12 Q11 G19... a6 ! a5 ... a2 a1 ao 
satisfying (6.34) was placed on a product, but that a certain digit 
ap (with 12 > k > 0) becomes illegible. Show that the value of a, 
may be recovered from the 12 remaining legible digits. (Hint: In 


the ring Z/10, the element 3 is a unit.) 


154 


(e) 


— 
las) 
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— 
ithe} 
Ree, 
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Explain why the error-correction capability described in (d) above 
is weaker than that given by the error-correcting codes introduced 
in Section 5.8. 


Suppose that a full 13-digit sequence 
I 


a42 411 4190 --- a6 I a5... G2 aj ag I 


satisfying (6.34) is being reported manually, but a keyboard error 
transposes two (distinct) adjacent digits a,41 ax (with 12 > k > 0), 
so that they appear in the order a, az+1 instead. Show that a check 
of the validity of (6.34) for the erroneous sequence 


— 
Q12 .-- Ak+2 Ak Ak4+1 Ak-1 -.--. GO 


will indicate that an error occurred, unless |ay41 — ax| = 5. 
The International Standard Book Number (ISBN) is a 10-digit code 


aj90 4g... @3 42 - ay, (6.35) 


in which the final digit a; is a check digit. The 9 information digits 
ai0,---,@2 are actually decimal, but they are interpreted as if they 
were undecimal (base 11). Working in the ring of integers Z/11 
modulo 11, the ISBN (6.35) is required to satify the condition 


Sok ax =0. (6.36) 


Show that for each sequence 
aj190 ag ... a3 a2 


of information digits, there is a unique element a; of Z/11 that 
may be appended to ensure that (6.36) is satisfied. (The Roman 
numeral X is used as a single undecimal digit for 10.) 

Confirm that ISBN 981-02-4942-X satisfies the condition (6.36). 
Confirm that the ISBN of this book satisfies the condition (6.36). 
Compute the full ISBN when the sequence of information digits is 
83-89656-20. 

Show that if a single digit in a valid ISBN is illegible, it can be 
recovered uniquely from the remaining 9 legible digits. 


Investigate the ability of ISBN encoding to recognize the occurrence 
of a transposition error, as discussed in (e) above for the EAN code. 
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3. Board games. Certain board games are played with two dice. The 
board is marked with squares. When it is their turn, players move their 
markers forward between 2 and 12 steps, according to the total of the 
two numbers between 1 and 6 shown on each die. 


FIGURE 6.2: Part of a board game. 


During the course of the game, players frequently end up with their 
markers on the square labeled 0 in Figure 6.2. To help win the game, it 
is useful to be able to predict which of the squares 2 through 12 these 
players will reach after their next move. 
Assuming that a die is fair, each of the six possible numbers 1, 2, ..., 
6 on its faces is equally likely. Thus the chance of throwing any given 
number is z However, when two dice are thrown, the possible totals 2, 
3, ..., 12 are not equally likely. The chances of throwing a given total 
may be computed within the ring R[X] of real polynomials. 
(a) Let 
1 1 1 1 1 1 
X) = po(X) = =-X°4+ -X° 4 2X44 X94 2X74 -X 
Pi(X) = pa(X) = EX" + EXO + EX + EX + EX + EX, 


so the chance of throwing r with die i (for 7 = 1,2) is given by the 
coefficient of X" in the polynomial p;(X). 


Show that the chance of throwing a total of t with the two dice is 
given by the coefficient of X* in the product polynomial 


p(X) - p(X). 


Compute the product pi(X) - p2(X). 


(d) If a player starts with their marker on square 0 in Figure 6.2, on 
which square are they most likely to land at their next turn? 


rice 
ay 


—> 
lo) 
— 


(e) If a player starts with their marker on square 0 in Figure 6.2, on 
which squares are they least likely to land at their next turn? 

(f) By what factor are they more likely to land on the most likely 
square, as opposed to one of the least likely squares? 
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4. Finite support. Let X be a set, and let (R,+,-) be a ring. Consider 

a function 
f:X OR. 
The set 
supp f = {x in X | f(x) 4 0} 

of elements of X at which the function f takes a nonzero value is called 
the support of f. The function f : X — R is said to have finite support 
if the set supp f is finite. Let f : X — Rand g: X — R be functions 
with finite support. 


(a) Show that the zero constant function has empty support. 

(b) Show that supp (f—g) is a subset of the union of supp f and supp g. 

(c) Show that supp (f - g) is a subset of the intersection of supp f and 
supp g. 

(d) Conclude that the set RX of functions f : X — R with finite 


support forms a nonunital subring of the ring of R-valued functions 
on X. 


If X is infinite and R is unital, show that the set Ro of functions 
f : X — R with finite support does not form a unital subring of 
the ring of R-valued functions on X. 


— 
o>) 
NW 


6.11 Notes 
Section 6.1 


Many authors do not distinguish explicitly between unital and nonunital 
rings, leaving the determination to the context. 


Section 6.3 


C.F. Gauss was a German mathematician and physicist who lived from 
1777 to 1855. 


Section 6.9 


F.G. Frobenius was a German mathematician who lived from 1849 to 1917. 


Chapter 7 


FIELDS 


Amongst all rings, the ring Z of integers and the ring R of real numbers have 
special properties that make them easy to manipulate. For example, nonzero 
integers may be canceled from equations, and nonzero real numbers have 
multiplicative inverses. The special properties are formalized in the concepts 
of “integral domain” and “field.” 


7.1 Integral domains 
In a general ring R, let R* denote the set 
{x in R| x40} 


of nonzero elements of R. The concepts of “integral domain” and “field” 
address the algebraic properties of the set R*. 


DEFINITION 7.1 (Integral domains.) A ring (R,+,-) is said to be 
an integral domain (abbreviated as: ID) if it is both commutative and unital, 
and if the set R* of nonzero elements of R forms a monoid (R*,-,1) under 
the multiplication of the ring. 


Example 7.2 (The ring of integers.) 
The ring Z of integers is the prototypical integral domain. ] 


Example 7.3 (The ring of real numbers.) 
The ring R of real numbers forms an integral domain. 


Example 7.4 (A product ring.) 
Consider the nonzero elements (1, 0) and (0,1) in the direct product Z/2xZ/o. 
Then 

(1,0) - (0,1) = (1-0,0- 1) = (0,0), 


so the commutative, unital ring Z/2 x Z/2 is not an integral domain. l] 
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Example 7.5 (Integers modulo 3.) 
The multiplication table for the set Z/3 of nonzero elements of the ring Z/3 
of integers modulo 3 is as follows: 


1 2 
1 1 2 
2 2 1 
Thus Z/3 forms an integral domain. ] 


Example 7.6 (Integers modulo 4.) 
The multiplication table for the set Z/{ of nonzero elements of the ring Z/4 
of integers modulo 4 is as follows: 


123 
SE ah 3 
2/202 
|| 30° 4 


Note that 2-2 = 0: The set Z/{ of nonzero elements of Z/4 is not closed 
under multiplication. Thus Z/4 does not form an integral domain. l] 


The commutative, unital rings of Examples 7.4 and 7.6 fail to be integral 
domains because of the behavior of special elements: 2 in Z/4, or (1,0) and 
(0,1) in Z/2 @ Z/g. In each case, 0 is a multiple of these nonzero elements. 


DEFINITION 7.7 (Zero divisors.) In a ring (R,+,-), a given element 
r is said to be a zero divisor if: 


(a) r is nonzero; 
(b) There is a nonzero element s of R with 


r-s=0 or s-r=0. (7.1) 
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Integral domains are then characterized amongst all commutative, unital, 
and nontrivial rings by the absence of zero divisors. 


PROPOSITION 7.8 (Integral domains and zero divisors.) 
Let R be a commutative, unital, nontrivial ring. The following conditions are 
equivalent: 


(a) R is an integral domain; 


(b) R has no zero divisors. 


PROOF (a) = (b): Suppose that R is an integral domain. The existence 
of nonzero elements r and s with (7.1) would violate the closure of the monoid 
(R*,-,1) under multiplication. 


(b) = (a): Suppose that R has no zero divisors. Since R is unital and nontriv- 
ial, the set R* contains the identity element 1. The absence of zero divisors 
then guarantees that R* forms a submonoid of (R,-, 1). 


REMARK 7.9 In Proposition 7.8, the hypothesis of nontriviality is nec- 
essary. The trivial ring R is commutative, unital, and devoid of zero divisors, 
but the empty set R* does not form a monoid. 


In Proposition 4.53 (page 85), it was observed that elements of a group may 
be canceled from equations. In integral domains, nonzero elements may also 
be canceled, even though the multiplicative structure does not form a group. 


PROPOSITION 7.10 (Cancellation in integral domains.) 
Let R be an integral domain, with elements a, bi, bo. Suppose that a is 
nonzero. If 


a:b) =a- bo, (7.2) 
then by = bo. 


PROOF If (7.2) holds, then a-b; — a- bz = 0. The left distributive law 
now yields 


Since a is nonzero, and R# has no zero divisors, the factor b; — bz in (7.3) must 
be zero. Thus b; = bo. 


Recall that integral domains are commutative, by definition. Thus given the 
cancellation from the left in Proposition 7.10, there is no need for a separate 
discussion of cancellation from the right, as in Proposition 4.53(b) for groups 
(page 85). 
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7.2 Degrees 


For a ring R, consider a polynomial 
p(X) = prX” + pn-1X"* + +++ + poX? + piX + po (7.4) 


in the polynomial ring R[X]. Suppose that the coefficient p, is nonzero. 
It is then called the leading coefficient of p(X), and the integer n is called 
the degree deg p or deg p(X) of the polynomial p(X). In particular, nonzero 
constant polynomials po — elements of the ring R — have degree 0. The 
zero constant polynomial 0 is deemed to have leading coefficient 0, and degree 
—oo. For any natural number n, we have 


—oo+n=-—c =n+ (-co) (7.5) 
and —oco <n, so max(—oo,n) = n. Also —oo + (—00) = —oo. 


The behavior of the degree of polynomials under the algebraic operations 
on R[X] is described as follows. 


PROPOSITION 7.11 (Inequalities for degrees.) 
Let R be a ring. Consider polynomials p(X) and q(X) over R. 


(a) deg (p(X) + q(X)) < max (deg p(X), deg q(X)); 


(b) deg (p(X) - q(X)) < deg p(X) + deg q(X) . 


PROOF If at least one of p(X), q(X) is zero, say p(X) = 0, then (a) 
reduces to the equality deg q(X) = max ( — 00, deg q(X)) = deg q(X), while 
(b) reduces to the equality —co = —oo + deg q(X). Otherwise, suppose that 


p(X) = pypX” + We gO 


and 


q(X) = GmX™ + Nea, Cupmaee Sane 


with p, £0 and qm 4 0. Without loss of generality, suppose n > m — if not, 
interchange the roles of p(X) and q(X). 


(a): Since p(X) + q(X) = (Pn + Gn)X”" + (Pn-1 + Gn-1)X" 1 +... ifn =m, 
and p(X) + q(X) = prX" +... ifn > m, we have 


deg (p(X) + q(X)) <n = max (deg p(X), deg q(X)) , 


proving that (a) holds. 
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(b): Since 
p(X) -q(X) = (prX” + pr-rX" 1 +...) + (amX™ + Gm-1X"* +...) 
Site Pe ie NO ese 8. (08) 
we have 
deg (p(X) - q(X)) < n +m = deg p(X) + deg q(X) 
proving that (b) holds. ( 


Example 7.12 (Strict inequality for degree of sum.) 

Consider the polynomials p(X) = X? +1 and q(X) = —X? + 2X in R[X]. 
Both have degree 2, but deg (p(X) + q(X)) = deg(2X + 1) = 1. Thus the 
inequality in Proposition 7.11(a) may be strict. lJ 


Example 7.13 (Strict inequality for degree of product.) 

Consider the polynomials p(X) = 2X? and q(X) = 2X?4+1 in Z/4[X]. Both 
of the polynomials p(X) and q(X) have degree 2, but deg (p(X) - q(X)) = 
deg(2X*) = 2 < deg p(X) +deg g(X). Thus again in Proposition 7.11(b), the 
inequality may be strict. ll 


For polynomials over integral domains, the inequality in Proposition 7.11(b) 
becomes equality. 


COROLLARY 7.14 (Equality for degree of product over an ID.) 
Let p(X) and q(X) be elements of the ring R[X] of polynomials over an 
integral domain R. Then 


deg (p(X) - q(X)) = deg p(X) + deg q(X). (7.7) 


In particular, the ring of polynomials over an integral domain is itself an 
integral domain. 


PROOF If p(X) or q(X) is zero, it was already seen that Proposition 7.11(b) 
becomes equality. Otherwise, both leading coefficients py of p(X) and qm of 
q(X) are nonzero. Since R is an integral domain, it follows that their product 
PnQm is nonzero. By (7.6), this nonzero product is the leading coefficient of 


p(X) - q(X), so 
deg (p(X): q(X)) =n +m = deg p(X) + deg q(X) 


as required for (7.7). Finally, if p(X) and q(X) are nonzero, their degrees are 
natural numbers. Then by (7.7), the degree deg (p(X) -q(X)) is also a natural 


number, so that p(X) - q(X) 40. (] 
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7.3 Fields 


In the commutative, unital ring R of real numbers, the set R* of nonzero 
elements forms a group (R%*,-,1) under multiplication. In other words, each 
nonzero real number has a multiplicative inverse: The group of units R* of 
the full monoid (R,-,1) is the set R* of nonzero reals. 


DEFINITION 7.15 (Fields, field homomorphisms.) A ring (R,+,-) 
is said to be a field if it is both commutative and unital, and if the set R* of 
nonzero elements of R forms a group (R,-,1) under the multiplication of the 
ring. A field homomorphism is a unital ring homomorphism between fields. 


Note the distinction between integral domains and fields. Let (R,+,-) be 
a commutative, unital ring. For R to be an integral domain, the set R* of 
nonzero elements has to form a monoid under multiplication. For R to be a 
field, R* has to satisfy the stronger requirement of being a group. The ring 
(Z, +,-) of integers is an integral domain, but it is not a field, since 2 does not 
have a multiplicative inverse in the set Z* of nonzero integers. On the other 
hand, 2 does have a multiplicative inverse 2~! in the field R of real numbers. 


Do not confuse additive inverses with multiplicative inverses. 
Each element r of a ring F# has an additive inverse —r in the 
additive group (R,+,0) of the ring (R,+,-), but unless r is 
a nonzero element of a field R, there is no guarantee of a 
multiplicative inverse r~! for r. 


For finite commutative, unital rings, there is no distinction between integral 
domains and fields. 


PROPOSITION 7.16 (Finite integral domains.) 
A finite integral domain is a field. 


PROOF Let R be a finite integral domain. By definition, the set R* 
of nonzero elements of R forms a monoid (R*,-,1) under multiplication. It 
remains to be shown that each nonzero element r of FR is invertible. 
Consider the set rN = {1,r1,r?,r°,...,r™,...,7",... } of powers of r. Since 
is a subset of the finite set R* of nonzero elements of R, there are natural 
numbers m <n such that r™ =r”. This equation may be written in the form 


7 
. (r . pre ere) ? 


Cancellation within the integral domain R (Proposition 7.10) then yields the 
equation 1 = r-r"—™—!, so that r is invertible. 
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Example 7.17 (The fields Z/,.) 

Let p be a prime number. The nontrivial commutative, unital ring Z/, of 
integers modulo p is an integral domain. Indeed, if (a + pZ) - (b+ pZ) = pZ 
for integers a and b, the prime number p divides the product ab. It follows 
that p divides a or p divides b, so that at least one of the factors a + pZ or 
b+ pZ is zero. By Proposition 7.16, it follows that the finite integral domain 
(Z/»,+,-) is a field. [For an alternative argument showing that Z/, is a field, 
see Exercise 10(c) below, and Example 6.30 on page 138.] l] 


Example 7.18 (The field Z/s[‘].) 
Consider the commutative, unital ring Z/3[i] of 2 x 2 matrices 


i | (7.8) 


Y x 


with entries from the field Z/3 (compare Example 6.17, page 133). The ring 
has 9 elements, since there are 3 independent choices from Z/3 for each of the 
entries x and y of a matrix (7.8). 

The determinant of the matrix (7.8) is 27+ y?. Now in Z/3, we have 0? = 0 
and 1? = 2? = 1, so 2? and y? lie in the set {0,1}. Furthermore, the only 
solution of 2? + y2 = 0 is x = y = 0. Thus nonzero matrices (7.8) have 
nonzero determinants. Since the determinant of a product of these matrices 
is the product of their individual determinants, it follows that Z/3[¢] is a finite 
integral domain, and hence is a field. 


Example 7.19 (Complex numbers.) 


The ring C of complex numbers forms a field — compare (6.32) and Study 
Project 1 in Chapter 6. 


Example 7.20 (Bit strings of length 2.) 


In Example 7.6, it was seen that the ring Z/4 of integers modulo 4 did not even 

form an integral domain, let alone a field. Nevertheless, there are fields with 
4 elements. Consider the set (Z/2)? of bit strings of length 2 (Example 4.35, 
page 78). This set forms a group with componentwise addition, an isomorphic 
copy of the Klein 4-group V4. With multiplication * as given in the table of 
Figure 7.1, the set of length 2 bit strings forms a field ((Z/2)?, +, *). Certainly 
the nonzero elements form a 3-element cyclic group under the multiplication. 
It remains to verify the distributivity, to check that the left multiplication 
(6.3) by each element r is an additive group homomorphism. This is trivial 
for r= 00 and r= 01. For r = 10, the left multiplication is an isomorphism 
from the additive group to itself, with left multiplication by 11 as the inverse 
isomorphism (Exercise 12). 


164 Introduction to Abstract Algebra 


FIGURE 7.1: Multiplication table of a 4-element field. 


7.4 Polynomials over fields 


If F is a field, the ring of polynomials FX] over F in the indeterminate X 
has some special properties. Since fields are integral domains, Corollary 7.14 
shows that F'[X] is an integral domain. In fact, F'LX] behaves even more like 
the ring of integers: It admits a division algorithm, much like the Division 
Algorithm for Z (Section 1.4). 


THEOREM 7.21 (Division Algorithm for polynomials over fields.) 
Let F be a field. Let 


A(X) = dyX™ +---+d,X + do 


be a nonzero polynomial (the divisor) in f[X], with leading coefficient dy, # 0. 
Then for each polynomial 


a(X) = a,X" +-+-+a,X + ao 


(a dividend) in FX], there is a unique polymomial q(X) (the quotient) in 
F(X], and a unique polynomial r(X) (the remainder) in f[X], such that 


deg r(X) < deg d(X) (7.9) 


and 


a(X) = d(X)q(X) + r(X). (7.10) 


PROOF = The proof proceeds by induction on the degree n of the dividend. 


Induction Basis: If dega(X) < degd(X), then (7.9) and (7.10) specify 
q(X) = 0 and r(X) = a(X) uniquely. 

Induction Step: Suppose that the Division Algorithm has been established 
for all dividends of degree less than the degree n of the given dividend a(X). 
In particular, suppose that 


n = deg a(X) > deg d(X) =m 
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(since otherwise, the Induction Basis applies). Consider the polynomial 
a(X) = a(X) — andy X"-™d(X). 


The coefficient of X” in a(X) is Gn — anbz'bm = 0, so deg a(X) < deg a(X). 
By the induction hypothesis, there are polynomials y(X) and r(X) with 
deg r(X) < deg d(X) and 
a(X) = d(X)7(X) + r(X). 
Then 
a(X) = d(X) (andy X"-™ + (X)) + 7(X), 
yielding (7.10) as required, with q(X) = a,b71X"—™ + 7(X). 


Uniqueness: Suppose 
a(X) = d(X)qi(X) + ri(X) = d(X)qa(X) + r2(X) 
with deg r1(X), deg ro(X) < deg d(X). Then 
r(X) — r2(X) = d(X) (a2(X) — u(X)), 


as a multiple of d(X) with degree less than deg d(X), must be zero. Thus 
ry(X) = ro(X) and qi (X) = q(X). 


The induction used to prove Theorem 7.21 yields a recursive procedure, 
Long Division, for dividing one polynomial by another in FLX]. As an ex- 
ample, consider the division of 3X° — X?+4 by 2X?+1 in Z/s[X]. The 
calculation is displayed as follows: 


AX 9 

2X?+1) 3x3 — x? +4 
Bue. > ae 

4X24X44 

AX? 4. 

be, 


At the first step, the divisor 2X?+ 1 is multiplied by 4X to obtain 3X°+4X. 
This multiple is subtracted from the dividend 3X?— X?+4 to yield 4X7+X+4. 
Next, the divisor 2X? + 1 is multiplied by 2 to obtain 4X? 42. This multiple 
is subtracted from the intermediate dividend 4X? + X +4 to yield X + 2. 
Since the degree of X +2 is less than the degree of the divisor, the polynomial 
X +2 is the remainder. Thus the equation (7.10) takes the form 


3X3 — X77 44 = (2X? +1)(4X + 2) + (X +2) 


for this example. 
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For linear divisors of the form (X —c), the results of division are particularly 
transparent. 


PROPOSITION 7.22 (Evaluating the remainder.) 
Let f(X) be a polynomial over a field F. Then for an element c of F,, the 
Division Algorithm yields 


F(X) = (X — o)q(X) + fle) (7.11) 


for some quotient q(X). 


PROOF The remainder 
F(X) — (X — e)q(X) (7.12) 


after division by the polynomial (X — c) must have degree less than 1, so it is 
some constant k in F'. The constant is obtained as k = f(c)—(c—c)q(c) = f(e) 
on evaluating (7.12) at the field element c. 


COROLLARY 7.23 (Roots and linear divisors.) 
In F[X], a polynomial f(X) has a field element c as a root if and only if the 
linear polynomial (X —c) divides f(X). 


Example 6.42 (page 146) indicated that the polynomial X? — 1 of degree 
2 in Z/s[X] has (at least) 4 roots in the ring Z/g. The final corollary of the 
Division Algorithm shows that such a surfeit of roots cannot happen with 
nonzero polynomials over fields. 


PROPOSITION 7.24 (When degrees bound the number of roots.) 
Let f(X) be a nonzero polynomial over a field F. Then the number of roots 
of f(X) in F does not exceed deg f(X). 


PROOF The proof proceeds by induction on the degree. The induction 
basis comprises the nonzero constant polynomials, which have degree 1, and 
no roots. For the induction step, consider a polynomial f(X) of degree n > 0, 
and suppose that no polynomial of degree n — 1 has more than n — 1 roots. 
Suppose that f(X) has a root, say c, in F. By Corollary 7.23, 


f(X) = (X — o)a(X) (7.13) 


for some polynomial q(X) of degree n— 1. If bis a root of f(X) distinct from 
c, then evaluation of (7.13) at b gives 0 = f(b) = (b—c)q(b) . Since (b—c) £0, 
it follows that q(b) = 0. By the induction hypothesis, there are at most n — 1 
possible such roots 6 distinct from c. Thus f(X) has at most n roots. ] 
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7.5 Principal ideal domains 


Section 7.4 shows that, like the ring of integers, the ring of polynomials 
FX] over a field F admits a Division Algorithm. It is helpful to formalize 
the similarities between the rings Z and F(X]. 


DEFINITION 7.25 (PID.) Let R be a commutative, unital ring. 


(a) An ideal J of R is described as principal if it is the set dR of multiples 
of a single element d of R. 


(b) The ring R is said to be a principal ideal domain (abbreviated as: PID) 
if it 1s an integral domain in which each ideal is principal. 


Example 7.26 (Integers form a PID.) 
As each ideal of Z has the form dZ for some natural number d (compare 
Example 6.31, page 138), the integral domain Z is a principal ideal domain. (] 


THEOREM 7.27 (A polynomial ring over a field is a PID.) 
The ring F |X] of polynomials over a field F is a principal ideal domain. 


PROOF § Since the field F' is an integral domain, Corollary 7.14 shows that 
FX] is also an integral domain. The trivial ideal 0F'[X] is principal. Consider 
a nontrivial ideal J of F[X], containing a nonzero element n(X). Define the 
subset 


S = {deg f(x) |0 4 f(X) in J} (7.14) 


of the set N of natural numbers. Since S contains deg n(X), it is nonempty. 
The Well-Ordering Principle shows that S contains a least element, say the 
degree deg d(X) of a nonzero element d(X) of J. It will be shown that J is 
the principal ideal d(X) FX]. 

Since d(X) is an element of J, and J has the absorptive property, the set 
d(X)F |X] of multiples of d(X) is a subset of J. Conversely, consider an 
element a(X) of J. It will be shown that a(X) is a multiple of d(X). The 
Division Algorithm yields 


a(X) = d(X)q(X) +7(X) (7.15) 


with degr(X) < degd(X). But r(X), as the difference a(X) — d(X)q(X) of 
the elements a(X) and d(X)q(X) of J, is itself an element of J. Since the 
degree of d(X) is not greater than the degree of any other nonzero element 
of J, it follows that r(X) is zero. Thus (7.15) actually expresses a(X) as a 
multiple of d(X). 
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A polynomial over a unital ring R is said to be monic if its leading coefficient 
is 1. Suppose that J is a nontrivial ideal of the ring F'[X] of polynomials over 
a field F. In the proof of Theorem 7.27, let M denote the set of all nonzero 
polynomials d(X) in J with deg d(X) as the least element of the set S in (7.14). 
Then amongst the polynomials in the set M, just one is monic (Exercise 17). 


DEFINITION 7.28 (Minimal polynomial of a nontrivial ideal.) A 
monic polynomial d(X) is called the minimal polynomial of the ideal d(X) FX]. 


Example 7.29 (Complex numbers.) 
Consider the evaluation homomorphism 


R[X] > C; p(X) & p(t) 


to the field of complex numbers from the ring of polynomials over the reals. 
The polynomial X?+ 1 lies in the kernel ideal, and has minimal degree there. 
(Indeed, Proposition 7.22 shows that the quotient of RLX] by the ideal of 
multiples of a linear polynomial would be isomorphic to R.) Thus the kernel 
ideal is the set (X?+1)R of multiples of the minimal polynomial X? +1. The 
First Isomorphism Theorem yields 


R[X]/(X? + 1)R[X] — C; (7.16) 
p(X) + (X* + 1I)R[X] + pli) 


as an isomorphism of the field of complex numbers with the quotient of the 
ring of real polynomials by the ideal of multiples of X? + 1. 


Example 7.30 (The bit-string field.) 
Consider the evaluation homomorphism 


Z/2|X] — (Z/2)*;p(X) + p(10) 


to the bit-string field of Example 7.20 from the ring of polynomials over the 
field of integers modulo 2. The minimal polynomial of the kernel ideal is 
X*+4+X +1, since 107+ 10+1=11+10+01=0. Thus 


Z/2[X]/(X* + X + 1)Z/2[X] > (Z/2)?; (7.17) 
p(X) + (X* + X + 1)Z/2[X] — p(10) 
is an isomorphism of fields. [| 


In order to work with quotient rings F|X]/J of rings of polynomials over a 
field F’, it is useful to establish a representation of their elements. 
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PROPOSITION 7.31 (Representing elements of F'X]/.J.) 
Let J be a nonzero ideal in the ring of polynomials over a field F. Let d(X) 
be the minimal polynomial of J, with deg d(X) =n. Then the map 


F" = F[X]/J; (7.18) 
(Pn—15Pn—25 ++ +37 1,70) 9 Pn 1X FT n—2X 7 Ho HX HO + I 
is an isomorphism of additive groups. 
PROOF The map (7.18) is certainly a group homomorphism, since both 
F” and F(X] have their additive group structures defined componentwise. 


Now suppose that the n-tuple (Tn—1,Tn—2,---,71,70) lies in the kernel of 
(7.18). Then the polynomial 


r(X) = tyr X" 1 tH rye X 74+ +X +79 (7.19) 


is a multiple of d(X). Since degr(X) < n = degd(X), it follows that the 
polynomial r(X) is zero, so the corresponding n-tuple 


(P53 aves 71,10) 


is also zero. Thus the group homomorphism (7.18) is injective, since its group 
kernel is zero. Finally, consider an arbitrary coset a(X) + J in the quotient 
F|X]/J. Using the Division Algorithm to write 


a(X) = d(X)q(X) + r(X) 
with degr(X) <n, say r(X) written as in (7.19), the coset 
a(X)+ J =r(X)+ I 
appears as the image of the n-tuple (Tn-1, Tn—2,---, 71,70) under (7.18). Thus 


the map (7.18) is surjective. 


Example 7.32 (Complex numbers.) 
With F = R and d(X) = X? +1, the composite of the group isomorphism 
(7.18) with the field isomorphism (7.16) is the map 


R? > C;(ri,r0) Brot rii 
— compare (6.29). (] 
Example 7.33 (The bit-string field.) 
Consider F = Z/ and d(X) = X?+ X +1. Then the composite of the group 


isomorphism (7.18) with the field isomorphism (7.17) reduces to the identity 
map on the set (Z/2)? of pairs or length 2 bit strings. 
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7.6 Irreducible polynomials 


An integer p > 1 was defined to be irreducible if the only expressions of 
p as a product p = ab of positive integers a and b are when a = 1 or b= 1 
(1.30). Example 7.17 then showed that the quotient Z/pZ of the principal 
ideal domain of integers by the ideal of multiples of an irreducible element 
p was actually a field. (Recall that the terms “prime” and “irreducible” are 
synonymous for positive integers, by Proposition 1.11.) This section pursues 
the analogy between the principal ideal domains Z and F|X] for a field F. 
The definition of irreducibility is extended to polynomials, and irreducible 
polynomials are then used to construct new fields. Polynomials which are not 
irreducible (according to Definition 7.34 below) are described as reducible. 
Reducible polynomials are nonconstant polynomials that admit a nontrivial 
factorization. 


DEFINITION 7.34 (Irreducible polynomials.) Let F' be a field. A 
nonconstant polynomial p(X) in F[X] is said to be irreducible (over F’) if 


p(X) = a(X)b(X) implies ( deg a(x) = 0 or deg b(X) =0 ) (7.20) 


for polynomials a(X) and b(X) in F[X]. 


Example 7.35 (The polynomial X? +1 over R and C.) 
The polynomial X?+1 is irreducible over the field R of real numbers. Indeed, 
if X?+1 admitted a nontrivial factorization, the factors would be linear, and 
then X? +1 would have real roots, according to Corollary 7.23. However, 
the square of each real number is nonnegative, so there are no real numbers 
r with r?+1=0. 

On the other hand, the polynomial X? + 1 is reducible over the field of 
complex numbers, admitting the nontrivial factorization 


X?41=X?-? =(X +i)(X -1) 
in the polynomial ring C[X]. (] 


In general, it can be very tricky to decide whether a given polynomial is 
irreducible or not. Fortunately, there is a fairly simple criterion for quadratic 
and cubic polynomials. 


PROPOSITION 7.36 (Irreducibility of quadratics and cubics.) 
Let F be a field, and let p(X) be a polynomial in F(X] with degree 2 or 3. 
Then p(X) is irreducible if and only if it has no roots in F. 
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PROOF If p(X) is reducible and admits a nontrivial factorization, then 
one of the factors is linear, say 


p(X) = (aX + ao) - b(X) 
with ap and (nonzero) a; in F’, and b(X) in FLX]. Then 
p( — ay ao) = (a1 - (—az*ao) + ao) + b(ay*ao) = 0, 


so p(X) has the root —ay'ag in F. 
Conversely, suppose p(X) has a root cin F. Then Corollary 7.23 shows 
that p(X) has a linear factor, and therefore is reducible. 


Example 7.37 (An irreducible quadratic.) 
The polynomial p(X) = X°+4 X +1 is an irreducible element of Z/2[X], since 
p(0) =0+04+1=140and pil) =1+14+1=1F0. ll 


Example 7.38 (Failure of the test for quartics.) 
Proposition 7.36 does not work for polynomials p(X) of degree larger than 
3. For instance, the reducible polynomial 


(X? +1)(X? +1) 
in R[X] has no real roots. (] 


The main use of irreducible polynomials is to bootstrap the construction of 
new fields from a given starter field. 


THEOREM 7.39 (The Bootstrap Theorem.) 

Let F be a field, and let p(X) be an irreducible polynomial in F[|X]. Let J 
be the ideal p(X)F |X] of multiples of p(X) in F[X]. Then the quotient ring 
F|X]/J is a field. 


PROOF § Since F[X] is commutative and unital, so is its quotient F[X]/J. 
It remains to be shown that each nonzero element f(X) + J of the quotient 
actually has an inverse g(X) + J, so that (f(X)+J)-(g(X)+J) =14J. 
Now f(X) + p(X)F[X] nonzero means that 


f(X) is not a multiple of p(X). (7.21) 


Consider the ideal f(X)F[X] + p(X)F[X] of F[X]. By Theorem 7.27, F[X] 
is a principal ideal domain. Thus there is some element a(X) of F[X] with 


f(X)F[X] + p(X) FIX] = aX) FIX]. (7.22) 
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In particular, the element p(X) = f(X)-0+ p(X) - 1 of the left-hand side of 
(7.22), as an element of the right-hand side of (7.22), is some multiple 


P(X) = a(X)0(X) 


of a(X). Since p(X) is irreducible, one of a(X) or b(X) is a nonzero constant. 
If b(X) is the nonzero constant bo, then a(X) = p(X)by'. But we would then 
have f(X) = f(X)-1+ p(X) -0 as an element of 

f(X)F[X] + p(X)F[X] = a(X)F[X] = p(X)bp' FX] = p(X) F(X]. 


This would give f(X) as a multiple of p(X), contradicting (7.21). Thus it 
is actually a(X) which is the nonzero constant, say ag. Now the identity 
polynomial 1 = ag - aj’ = a(X)-ag' is an element of (7.22), so there are 
polynomials g(X) and q(X) with 1 = f(X)g(X) + p(X)q(X) . Then, recalling 
the definition of the multiplication in quotient rings, we have 


(F(X) + J) - (G(X) + J) = f(X)g(X) + J 
= F(X) g(X) + p(X) FIX] = 1+ p(X) FTX] 


as required. J 


Example 7.40 (An 8-element field.) 

By Example 7.37, there is an irreducible polynomial p(X) = X°+X +1 over 
the field Z/2. Theorem 7.39 then shows that a field is given by the quotient 
Z/a{X/J of the polynomial ring Z/2[X] over the ideal J = p(X)Z/2[X] of 
multiples of p(X). By Proposition 7.31, this field has 8 elements, and each 
coset may be represented by a polynomial of degree at most 2. It is instructive 
to compute the successive powers of the coset X + J: 


XP-FaK3+ (X94 X44) + FH X4+14+I; 

X44 J2X-XP4TaHX(X414 I= X74 X45; 

Xe ef aK (XP X) tJ aX Kt IJ aK EX 4147; 
X®4 J aX -(X* 4X41) + IS X37 + X72 4X4 IFS X7 4145; 
X 4 J=2X-(X7414 7 = O44 X4 SJ H14 7. 


Note that the multiplicative group of nonzero elements of the field is cyclic, 
generated by the coset X + J. Thus the table of powers of X + J may be 
used to compute products and inverses in the field, while Proposition 7.31 
indicates how to compute sums. For example, we have 


X° 4X8 = (X27 4X 41)4+ (KX? +1) =X 


and 
(X41)- (X74 X47) 3X8. PSK KT AX, 


using the list of powers (and omitting explicit mention of the ideal J). (] 
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7.7 Lagrange interpolation 


Field structures on a set are very useful in a wide variety of contexts. In this 
section, we discuss one typical application, to the specification of functions. 
Suppose that F is a field. For a natural number n and distinct elements 20, 
Y1,---, Ln of F, it is desired to construct a function f : F — F taking a 
particular function value 

f (ti) = yi 
in F for each 0 < 4 < n (compare Figure 7.2). The function should be easy 
to specify, and easy to compute, even when the set F is too large for a simple 
table look-up to be feasible. 


F 
A 
(71,41) 
ae. (2n, Ym) 
| Hy Moe Sa, ates eed es ++ 
F 
Xo 1 In 


FIGURE 7.2: Specifying a function. 


To design a function with the desired properties, fix an index 0 < 7 < n, and 
consider the simpler problem of finding a function 6,, satsifying the following 


specification: 
1 ifi=j; 
bas(t) =) 0 es 2, 
0 ifif7. 
Such a function is known as a (Kronecker) delta function. It is implemented 
as the polynomial 


(X — Xo) eae mele. a4) ...(X — 2p) 


(aj — 0)... (@j — Bj-1)(@j — Bj41)-.. (Lj — Ln) We) 


5z,(X) = 


of degree n. Note that, since x; is distinct from all the other elements 29, 
ws, Uj-1, Lj41, +--+, Ln, the denominator of (7.23) represents a nonzero field 
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element, which thus has an inverse. Note further that for each index i distinct 
from j, the factor (X — 2;) in the numerator of (7.23) vanishes at x;, so 
Jx,(%i) = 0 in this case. Finally, note that each factor 


(xj — 24) 


(with ¢ # j) takes the value (x; — x;)(x; — aj)~' = 1 at xj, so 6z,(a;) =1, as 
required. 
The function f : fF — F can now be implemented by evaluating the poly- 


nomial 
n 


F(X) = Soy; be,(X), (7.24) 


j=0 


a sum of constant multiples of the polynomials (7.23). The polynomial (7.24) 
is known as the Lagrange interpolant for the specified function values. Note 
that, for each 1 <7 <n, we have 


(xi) = > Yj Ox; (Xi) 
j=0 


a1 
= = ui bx, (Xi) + yi + Sx; (24) se5e Yi ° bx; (xi) 
j=0 


j=itl 
=0+y%-1+0=y% 


as required. The result may be summarized as follows. 


THEOREM 7.41 (Lagrange interpolation.) 

Suppose that x0, 11, ..., tm aren+1 distinct elements of a field F. Then 
for any elements yo, Y1, ---, Yn of F’, there is a polynomial f(X) of degree at 
most n over F’ such that f(a;) = y; forO<i<n. 


Example 7.42 (Interpolants of unexpectedly low degree.) 

A Lagrange interpolant for n + 1 function values may have a degree less 
than n. For instance, the Lagrange interpolant to the identity function on the 
3-element field Z/3 is X (Exercise 23). 


COROLLARY 7.43 (Self-maps of finite fields are polynomials.) 
Let F be a finite field. Then each function f : F — F may be implemented 
as the evaluation of a polynomial f(X) over F. 


The property enunciated in Corollary 7.43 is sometimes expressed by the 
statement that finite fields are polynomially complete. 
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7.8 Fields of fractions 


A final method of constructing new fields, this time infinite, mimics the 
construction of the rationals from the integers (Section 3.3). Let D be an 
integral domain. Consider the set 


nm 
«= 1[0] 
of matrices over D. Note that if D = Z, the top row of a matrix from X 
corresponds to a solid dot in Figure 3.1. 
A relation R is defined on the set X by 


Ny My, ng m2 a a 
k ie R ke | if and only if 


n,m in D, nzo} 


ny my 


eons 0 (7.25) 
— compare (3.6). As in Proposition 3.8, the relation R is seen to be an 
equivalence relation on the set X. (The properties of the ring of integers 
that were used in the proof of Proposition 3.8 are all properties that have 
been abstracted into the concept of an integral domain.) Define an addition 
operation on the set Xp of equivalence classes by 


ny My nz ™M2 Ny My nz ™M2 

oes Fa os Peek Cc a oe 
(using the usual multiplication of matrices). This addition operation is well 
defined (compare Exercise 7 in Chapter 3). Since matrix multiplication is 
associative, it follows directly that the addition on Xp is associative. Since 
X forms a commutative subsemigroup of the semigroup of matrices under 


the usual multiplication, it again follows directly that the addition on Xp is 
commutative. Furthermore, the equivalence class 


10 
fo 
01), 


of the identity matrix is an additive identity for the semigroup (Xp, +), so 
we obtain a commutative monoid (Xpr,+,0). Finally, note that 


Dt Oe) Cond Coo], fo el, = fot 
= . = ie 
07] p De tes On On x OLS | a 
— the latter equality holding directly by (7.25) — so that (Xp,+,0) is seen 
to form an abelian group. 


PROPOSITION 7.44 


There is an injective abelian group homomorphism 


: 1 
jo * (Dy +50) > (AR tt, 0) i : 
01 2 
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PROOF The map 7p is an abelian group homomorphism, since 
1 My, ; 1 mg) 1 m, + mg 
0 1 O11] jo 1 


for elements m1, m2 of D. By (7.25), the group kernel of jp is the set {0} of 
elements m in D with 


=—-mM, 


o=[ 


1 ; 
so jp is injective. (] 


Now define a commutative operation of multiplication on XR by 


my ™y ng m2 = nyng ™1mM~2 
Palomo male 8 


Note that the right-hand side of (7.27) may be written as the equivalence 


class 
Ny My, iS n2 M92 
0 Ny 0 ng R 


of the componentwise or Hadamard product of the representatives of the 
classes on the left-hand side of (7.27) — compare (4.18). The operation (7.27) 
is well defined, as shown by the argument on page 55. Since the Hadamard 
multiplication of matrices is associative, Xp forms a commutative semigroup 
under the multiplication (7.27). The equivalence class 


ia 
t= fo 
sli] Gs 


forms an identity element for the multiplication, yielding a monoid (Xp,-, 1). 
Moreover, the map jp : (D,:,1) — (Xpr,-,1) of Proposition 7.44 is a monoid 
homomorphism. Finally, note that 


le ond. CO nd bo, 


— the latter equality holding directly by (7.25) — for each nonzero element 


nm 

On} p 
of Xp. Thus the set of nonzero elements of Xp forms a commutative group 
under multiplication. A direct verification shows that the multiplication in 


Xp distributes over the addition (Exercise 26). Altogether, we obtain a field 
(Xpr,+4+,-), and jp : D— Xp becomes a unital ring homomorphism. 
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Let D be an integral domain. The field Xp is called the field of fractions F'p 
of the integral domain D. Each equivalence class 


tae 


is written in the forms 


ae es n-'m = m/n 
n 
The addition becomes 
a " Cx ad + be 
b d bd 
The multiplication becomes 
a@ c_ ac 
b d bd 


The inversion of nonzero elements becomes 


oe. 
b G2 
Then the map jp takes the form 

jp: D— Fp;me =. 


The integral domain D is usually identified with its image under the injective 
map jp. 


Example 7.46 (The field of fractions of the integers.) 
The field of fractions of Z is the field Q of rational numbers. (] 


Example 7.47 (Rational functions.) 

Let D be an integral domain, and let X be an indeterminate over D. By 
Corollary 7.14, the ring D[X] of polynomials in X over D forms an integral 
domain. The field of fractions of the integral domain D[X] is called the field 
D(X) of rational functions in X over D. Thus an element of D(X) is of the 


form 
F(X) 


g(X) 
in which f(X) and g(X) are polynomials over D, the polynomial g(X) being 
nonzero. Such elements are known as rational functions over D. [| 
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Exercises 


1. Show that the ring Z/s of integers modulo 5 forms an integral domain. 


10. 


. Show that the ring Z/¢ of integers modulo 6 does not form an integral 


domain. 


. Let n be a prime number. Show that the ring Z/,, of integers modulo n 


forms an integral domain. 


. Let n be acomposite number. Show that the ring Z/, of integers modulo 


n does not form an integral domain. 


. Show that the ring Z/i] of Gaussian integers forms an integral domain. 


. Let (R,+,-) be a commutative ring. Let Z be the subset of R consisting 


of 0 and the zero divisors. Show that (Z,-) is a subsemigroup of (R,-). 


(a) Show that with the definition (7.5), the union 
N_owo = {-cof UN 
of the singleton {—oo} with the set of natural numbers forms a 
commutative semigroup under addition. 
(b) Is N_.~ a monoid? 
(c) Setting 
2°" =0 


? 


show that 
(Neos) Pri: (N, ‘); dw of 


is a semigroup homomorphism. 


. What is the inverse of the nonzero element 25 in the field Z/41? 


. Consider the subset 


S = {0,2,4,6,8} 
of Z/10Z (compare Exercise 17 in Chapter 6). Show that S forms a 
field. 
Let R be a simple, unital commutative ring. 
(a) Suppose that a is a nonzero element of R. Show that the set 
aR = {ar|r in R} of multiples of a is a nontrivial ideal of R. 
(b) Show that each nonzero element of R is invertible. 


(c) Conclude that a simple, unital commutative ring is a field. 
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(d) Give an example of an integral domain which is not simple. 


(e) Give an example of a simple, commutative ring which is not a field. 


11. Show that the ring Z/5[?] has a zero divisor. 


12. Consider the group ((Z/2)?,+,00) of length 2 bit strings discussed in 
Example 7.20, with the multiplication « displayed in Figure 7.1. 


(a) Show that the left multiplication by 11 (using *) is inverse to the 
left multiplication by 10. 


(b) Show that the left multiplication by 10 gives a homomorphism from 
the additive group ((Z/2)?,+, 00) to itself. 


13. In the ring Z/3[X] of polynomials over the field Z/3 of integers modulo 3, 
find the quotient and remainder when the polynomial X*— 1 is divided 
by X°4+ X +1. 


14. Let f(X) be a polynomial over a field F’. For a general linear polynomial 
a,X + ao (with a, # 0) over the field F’, show that the remainder left 
after dividing f(X) by a1X + ao is f(—az‘ao). 


15. (Fermat’s Little Theorem.) Let p be a prime number. 


(a) Show that each nonzero element of the field Z/, is a root of the 
polynomial X?~' — 1 over Z/,. (Hint: Apply Exercise 36 from 
Chapter 5 to the group G of nonzero elements of Z/,.) 

(b) Show that each element of the field Z/, is a root of the polynomial 
X? — X over Z/y. 


16. Consider the polynomial p(X) = X" — 1, for a positive integer r. 


(a) Show that p(X) has r roots in the field of complex numbers. (Hint: 
Compare Exercise 35 in Chapter 5.) 
(b) Show that p(X) may have less than r roots in the field of real 


numbers. 


(c) For which positive integers r does p(X) have r roots in the field of 
real numbers? 


17. Let J bean ideal in the ring FLX] of polynomials over a field F’. Consider 
the set M of nonzero polynomials in J whose degree is minimal in the 
set S' of (7.14). 


(a) Given two polynomials in M, show that each is a multiple of the 
other. 


(b) Given two polynomials in M, show that each is a constant multiple 
of the other. 
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(c) Given a polynomial in M, show that each of its nonzero constant 
multiples also lies in M. 


(d) Show that the set M contains a unique monic polynomial. 


18. Consider the elements 2 and X in the ring Z[X] of polynomials over the 
integers. 


(a) Show that 2Z[X] + XZ[X] is an ideal of Z[X]. 
(b) Show that there can be no polynomial d(X) in Z[X] for which 
2Z|X] + XZ[X] = d(X)Z[X]. 


(c) Conclude that Theorem 7.27 cannot be generalized, replacing the 
field F' with an integral domain. 


19. Let R be the set of all the 2 x 2 matrices of the form 
x 3y 
y « 

with x and y from the ring Z/s of integers modulo 5. 


(a) Show that R is a subring of the ring (Z/5)3 of all 2 x 2 matrices 
over Z/5. 


(b) Show that R is a field. 


(c) Find the minimal polynomial of the kernel ideal of the evaluation 
homomorphism 


Z/5|X] > R; p(X) + p( f 4 ) 


20. Show that the polynomial X?+4 X+1 is an irreducible element of Z/7[X]. 


21. Let p(X) be a polynomial over a field F’, and let c be an element of F. 
Show that p(X) is irreducible if and only if p(X + c) is irreducible. 


22. Consider the polynomial p(X) = X? + 1 over the field Z/3 of integers 
modulo 3. Let J be the ideal p(X)Z/3[X] of multiples of p(X). 


(a) Show that p(X) is irreducible over Z/3[X]. 
(b) Compute the powers of the coset X + J in the field Z/3[X] / J. 


(c) Find the unique representative r(X) of the coset X3+X°+ J with 
deg r(X) < 2. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


3l. 
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(d) Compute the product 
LPP HOXPTES) 
in the field Z/3[X] / J. 


(e) Show that Z/3[X] / J is isomorphic to the field Z/3[#] discussed in 
Example 7.18. 


Verify that the Lagrange interpolant 
0» do(X) + 1+ d1(X) + (-1) - 6-1(X) 


to the identity function id: Z/3 — Z/3 on the 3-element field of integers 
modulo 3 is X (compare Example 7.42). 


Show that the Lagrange interpolant passing through two specified points 
(xo, yo) and (#1, 1), with zo 4 21, is 
1 Zo Yo 
/X : 
ac fn yo) rh 


Show that the infinite field of real numbers is not polynomially complete: 
Give an example of a function f : R — R which cannot be implemented 
as the evaluation of a polynomial p(X) over R. Justify your claim. 


Let D be an integral domain. Show that the multiplication (7.27) in 
Xp distributes over the addition (7.26). 


Suppose that an integral domain D is actually a field. Show that the 
embedding jp : D — Fp of D into its field of fractions is an isomorphism 
of fields. 


Show that the field of fractions of the integral domain Z/i] of Gaussian 
integers (compare Exercise 5) is Q{i]. 


Let D be an integral domain, and let 6: D — K be an injective unital 
ring homomorphism from D to a field K. Show that there is a unique 
field homomorphism Fg : Fp — K such that Fg o jp = 0. 


Let D be an integral domain, and let X be an indeterminate over D. 
Show that the field of fractions Fp of D is a subfield of the field D(X) 
of rational functions in X over D. 


Let D be an integral domain, and let X be an indeterminate over D. 


Show that 5 
XT — 1 
=X"4 X14 ..4 X41 
xX —1 


for each natural number n. 
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7.10 Study projects 
1. Permutation polynomials. Let F be a finite field. A polynomial 
p(X) in F[X] is a permutation polynomial if its polynomial function 
F = F;e+ plo) 


— compare Definition 6.40(b) — is a permutation of the finite set F’. 


(a) Show that a polynomial of degree 1 in FX] is a permutation poly- 
nomial. 


(b) Show that the symmetric group 53 is the full set of polynomial 
functions of degree 1 polynomials in Z/3[X]. 


(c) Show that the degree 1 polynomial functions over the field Z/s 
form a proper subgroup of the symmetric group S55. 


(d) Show that the degree 1 polynomial functions, together with the 
degree 3 permutation polynomial functions, combine to give the 
full symmetric group S5. 


(e) Give an example of a degree 3 polynomial over the field Z/5 which 
is not a permutation polynomial. 


2. Simpson’s Rule. Suppose that 
f:[/-L1-R 


is a continuous, real-valued function defined on the interval [—1,1] of 
real numbers r with —1 < r < 1. Simpson’s Rule approximates the 


Riemann integral 
1 
[ fou 
-1 


as the integral of the Lagrange interpolant polynomial function p(X) 
that is determined by the specifications 


-1e f(-1), Or f(0), 1h f(1). (7.28) 


(a) Find the quadratic interpolant p(X) specified by (7.28). 


(b) Evaluate the Riemann integral 


[reo (7.29) 


of the quadratic interpolant. 
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(c) Consider the function cos (7a/2) on the interval [—1, 1]. Compute 
the true value of the Riemann integral 


1 
if cos (=) dx, 
-1 2 


and compare it with the value (7.29) given by Simpson’s Rule in 
this case. 


3. Primitive elements. Given a prime number p, and an irreducible 
polynomial p(X) of degree n in the ring Z/,[X] of polynomials over 
the p-element field Z/, , Theorem 7.39 shows that the quotient ring 
Z/,[X]/J by the ideal p(X)Z/p[X] of multiples of p(X) forms a field 
K. Proposition 7.31 shows that this field has p” elements, and it also 
reduces the additive group structure (AK,+,0) to the power (Z/,,+,0)”. 
The problem of computing products and inverses in the field K still 
remains. This problem is solved by the use of primitive elements. 


An element e of a finite field K is said to be primitive if the set (e) of 
powers of e is the full set K* of nonzero elements of the field K. 


(a) Show that X + J is a primitive element of the 8-element field in 
Example 7.40. 


(b) Show that the polynomial p(X) = X? + X +1 is irreducible over 
the field Z/s. 


(c) Define J = (X? + X +1)Z/s5[X] and K = Z/s[X|/J. Show that 
X + J is not a primitive element of K. 


(d) Show that (X +1) + J is not a primitive element of K. 
(e) Show that (X + 2) + J is a primitive element of K. 


4. Discrete logarithms. Given a primitive element e for a finite field K, 
each nonzero element x of the field appears as a unique power e! of e 
with 0 <1 < |K|—1. This power is defined as the discrete logarithm 
log. z of « to the base e. A table of the discrete logarithms to base 
X+J for the 8-element field of Example 7.40 is displayed in Figure 7.3. 
The coefficients of the unique coset representatives of degree less than 
3 are written as bit strings. 


x 001 | 010 | O11 | 100 | 101 | 110 | 111 


log, x 0 1 3 2 6 4 5 


FIGURE 7.3: Discrete logs to base X in Z/2[X]/(X? + X + 1)Z/2[X]. 
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n 0 1 2 3 4 5 6 
e” 001 | 010 | 100 | O11 ) 110 | 111 | 101 


FIGURE 7.4: Discrete antilogs to base X in Z/2[X]/(X? +X +1)Z/2[X]. 


Figure 7.4 displays the corresponding “antilogarithms,” the successive 
powers of the primitive element e. 


Compute similar tables of discrete logarithms and antilogarithms for 
the primitive element e = (X +2) + J in the field K = Z/5[X]/J with 
J = (X?+X 4+1)Z/s[X]. Field elements can be represented as pairs 
ryro of elements of Z/s, the coefficients of the representative polynomial. 


7.11 Notes 
Section 7.1 


Some authors use the absence of zero divisors in nontrivial, commutative 
unital rings to define integral domains (compare Proposition 7.8). However, 
Definition 7.1 is more direct and positive. 


Section 7.7 


L. Kronecker was a German mathematician who lived from 1823 to 1891. 


Section 7.8 


The use of matrices in the construction of the field of fractions helps to 
minimize the number of formal verifications required, leaving only the proof of 
transitivity of the equivalence relation R, and confirmation of the distributive 
law. 


Section 7.9 


P. Fermat was a French mathematician who lived from 1601 to 1665. His 
“Little Theorem” (Exercise 15) is not to be confused with his “Last Theorem,” 
the nonexistence of positive integers x, y, z with «” + y” = z” when n > 2. 


Section 7.10 


T. Simpson was an English mathematician who lived from 1710 to 1761. 


Chapter 8 


FACTORIZATION 


The Fundamental Theorem of Arithmetic governs factorization of integers. 
Factorization of a polynomial is the key step to location of its roots. In this 
chapter, we consider the factorization of elements in various rings, obtaining 
a deeper insight into the structure of finite fields. 


8.1 Factorization in integral domains 


Integral domains follow the model of the ring Z of integers. Factorization 
in the ring of integers was studied in Chapter 1. However, the discussion 
there focussed mainly on positive integers. Now the group of units of Z is 
Z* = {+1}, and each nonzero integer n is related to the positive integer |n| 
by the equation |n| = u-n for a unit u of Z. When discussing factorization in 
general integral domains, consideration of units (invertible elements) is a key 
feature. In an integral domain D, a factorization 


a=b-e 


of an element a of D as a product of elements b and c of D is said to be proper 
if neither b nor c is a unit. 
The following observation is very useful. 


PROPOSITION 8.1 
Let d be a nonzero element of an integral domain D. Then for an element u 
of D, the equality dD = duD holds if and only if u is a unit. 


PROOF Each multiple of du is certainly a multiple of d, so the set duD 
is always a subset of dD. 

If u is a unit, say uv = 1 for v in D, then each multiple dx of d is the 
multiple duva of du, so dD is a subset of duD, and the equality dD = duD 
holds. 

Conversely, suppose dD = duD, so the element d1 of dD is some multiple 
duv of du. Cancellation (Proposition 7.10, page 159) in the equation dl = duv 
implies that 1 = uv, so u is a unit. (] 
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In Section 1.7, two properties of positive integers were introduced: 
e the internal property of being irreducible, and 
e the external property of being prime. 


Here are the corresponding properties in general integral domains: 


DEFINITION 8.2 (Irreducibles and primes.) Let D be an ID. 


(a) An element a of D is said to be irreducible in D if it is not zero, not a 
unit, and if it has no proper factorization a = b-c in D. 


(b) An element a of D is said to be prime in D if it is not zero, not a unit, 
and if 
a | be implies (a|b or alc) 


for elements b and c of D. 


REMARK 8.3 (a) For a unit u, an element a of an integral domain D is 
irreducible if and only if wa is irreducible (Exercise 2). 

(b) A nonzero, noninvertible element a of an integral domain D is reducible 
if it is not irreducible. A reducible element does have a proper factorization 
in D. 

(c) The clause “in D” is an important rider on the terms “irreducible” and 
“prime” in Definition 8.2. For example, even if a nonzero element a of an 
integral domain D is prime in D, it is not prime in the field of fractions Fp, 
since there it is invertible. Also, compare Example 7.35 (page 170). J 


Example 8.4 (Integers.) 

An integer n is irreducible in Z according to Definition 8.2(a) if and only 
if |n| is a prime positive integer. This condition, in turn, is equivalent to n 
being prime in the sense of Definition 8.2(b). 


Example 8.5 (Polynomials.) 

Let F’ be a field. Consider the integral domain FLX] of polynomials over F’, 
and an element f(X) of F[X]. The units in the ring F[X] are the nonzero 
constants, the polynomials of degree 0. Now f(X) is nonzero and not a unit 
if it is not a constant, so deg f(X) > 0. Thus f(X) is irreducible in FLX] by 
Definition 8.2(a) if and only if it is irreducible by Definition 7.34. 


For positive integers, the two properties: being irreducible, and being prime, 
turned out to be equivalent (Proposition 1.11, page 13), so the single term 
“prime” sufficed in that case. An example shows that in a general integral 
domain, the two concepts may differ. 


FACTORIZATION 187 


Example 8.6 (Irreducibles need not be prime.) 
Consider the commutative unital subring 


aa 


Y x 


x,y in 2} 


of the ring Z3 of 2 x 2 matrices over Z. Note that 
0 —3} |0 -3 3.0 
? Al fi id = E | = (3) 


so the matrix i ‘i serves aS a square root of —3, and each element of 


Z|V/—3] may be written in the form 


i | =rt+yVv—3 


Y x 


with integers x and y. Now 


det ° = = 27 + 3y? =0 
y 2 


if and only if «= y =0. Recalling 
det(AB) = det(A) det(B) 


for matrices A and B (Exercise 7 in Chapter 5), it is apparent that Z[V-3 | 
is an integral domain. Moreover, the determinants of elements of VAN, =3] 


are 
0, 1,3, 4,7, 9, 12, 13, 16,... (8.1) 


in increasing order, an element being invertible in Z[V—3] if and only if its 
determinant is 1. Now 


ial aoa lea loa: 2) 


ba] Lay) bo 


of 4/2 are irreducible in Z[V-3 ls since their respective determinants are 
all equal to 4, an integer which does not factor nontrivially into a product 
of members of the ordered list (8.1). Thus elements of Z[/—3]| may not 
have a unique factorization into products of irreducibles. Moreover, since the 
irreducible element 2/2 divides the product on the left-hand side of (8.2), but 
does not divide either of the two individual factors in that product, it is not 
a prime element of Z[/—3]. 


The factors 
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The following result clarifies the general relationship between the concepts 
of Definition 8.2 within an integral domain D: 


prime (external property) 
implies 
irreducible (internal property) 


PROPOSITION 8.7 (Primes are irreducible.) 
Let D be an integral domain. If an element a of D is prime in D, then it is 
also irreducible in D. 


PROOF Suppose that the prime element a factorizes as a = bc in D. Then 
certainly a | bc. Since a is prime, it follows that a divides at least one of b and 
c, say c = ua for some element u of D. Then lc = c = ua = ubc, so 1 = ub 
by cancellation (Proposition 7.10, page 159). It follows that 6 is a unit in D: 
the factorization a = bc is improper. J 


8.2 Noetherian domains 


The Fundamental Theorem of Arithmetic has two parts. The “existence 
part” of the theorem, Theorem 1.13 (page 15), shows that each integer n > 1 
may be factorized as some product of irreducibles. Then the “uniqueness part” 
of the theorem, Theorem 1.14 (page 15), shows that any such factorization 
is essentially unique. Noetherian domains, the topic of this section, form a 
general class of integral domains having an analogue of the “existence part” 
of the Fundamental Theorem of Arithmetic. (For a complete characterization 
of those integral domains in which factorizations exist, without necessarily 
existing uniquely, see Exercise 7.) 

Let R be aring. An ascending chain of ideals in R is a sequence 


Jo < Ji < ues In,“ In41 < vik (8.3) 


of ideals J, of R, for natural numbers n, such that each ideal J, has an 
embedding J, — Jn+i as a subset of its successor. (Compare Example 6.21, 
page 135.) The union 


J= (J In ={r|r lies in Jn for some n in N} (8.4) 
neN 


of the members of the sequence is again an ideal of R (Exercise 5). 
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DEFINITION 8.8 (ACC, Noetherian domains.) Let R be a ring. 


(a) The ring R is said to satisfy the ascending chain condition, or ACC, 
if for each ascending chain (8.3) of ideals J, of R, there is a natural 
number N such that Jm = Jn for allm> WN. 


(b) The ring R is said to be a Noetherian domain if it is an integral domain 
that satisfies the ascending chain condition. 


The ascending chain condition may be paraphrased as saying that there are 
no infinite ascending chains (8.3): All eventually stabilize at some point Jy. 


PROPOSITION 8.9 (PID implies ACC.) 
Each principal ideal domain D is Noetherian. 


PROOF Consider an ascending chain (8.3) of ideals J, in D. Since the 
union (8.4) is again an ideal of the principal ideal domain D, it may be written 
in the form 

J=aD 
as the set of multiples of a certain element a of D. By the definition (8.4) of 
J, there is an element Jy of the sequence (8.3) such that a lies in Jy. For 
each integer m > N, the ideal Jy is a subset of Ji, since the sequence (8.3) 
is ascending. But J, is a subset of J = aD. By the absorption property of 
the ideal Jy, the set aD is contained inside Jy, so Jm is a subset of Jy. Thus 
Jm = Jn for all m> N. 


PROPOSITION 8.10 (Factors in Noetherian domains.) 
Let D be a Noetherian domain. Then each nonzero, noninvertible element of 
D has an irreducible factor. 


PROOF Assume that the proposition is false, and that ag is a nonzero, 
noninvertible element of D that does not have an irreducible factor. As an 
induction hypothesis, suppose that a, is a nonzero, noninvertible element of 
D that does not have an irreducible factor. Since a, is not itself irreducible, 
it has a proper factorization an = @n41bn+1, in which the factor a,4+1 is 
a nonzero, nonunit element of D that does not have an irreducible factor. 
By induction, we obtain such elements a, for each natural number n, with 
AnD = an4ibn41D contained in an+1D. 

Now consider the ascending chain 


agD — a,D eS An DO anqyiDo... 


of ideals in D. By the ascending chain condition, there is a natural number N 
with an D = ani1bn41D = an4i1D. Proposition 8.1 then implies that by 41 
is a unit, contradicting the properness of the factorization ay = an410Nn 41. 
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PROPOSITION 8.11 (Factorization in Noetherian domains.) 
Let D be a Noetherian domain. Then each nonzero, noninvertible element of 
D is a product of irreducible factors. 


PROOF Let a = ao be a nonzero, noninvertible element of D. As an 
induction hypothesis, suppose that a; is a nonzero, noninvertible element of 
D. Tf a; is not irreducible, it has a proper factorization a; = pj41a;+1 with an 
irreducible factor pj+1, by Proposition 8.10. Thus 


ao = P11 


= Pip2a2 


= P1p2..-Pidi 
= P1p2---PiPi414i+1 


Now a;D = pj414;41D is contained in a;,,D for i = 0,1,..., and so on. 
Consider the ascending chain 


agoD + a,D « eae a;D < ai41.D Arar 


of ideals in D. Since D is Noetherian, the chain stabilizes as ajn_1;D = anD 
for some natural number n. Thus a, is an irreducible element p, of D, and a 
factorizes as @ = Pp, ... Dn. 


COROLLARY 8.12 (Factorization in principal ideal domains.) 
In a principal ideal domain D, each nonzero, noninvertible element is a 
product of irreducible factors. 


PROOF By Proposition 8.9, the principal ideal domain D is Noetherian. 
Proposition 8.11 then gives the desired result. l] 


8.3. Unique factorization domains 


The full conclusion of the Fundamental Theorem of Arithmetic, existence 
and uniqueness, is formalized in the concept of a unique factorization domain. 
In order to capture the correct level of uniqueness, two elements p and q of a 
general integral domain D are defined to be associates if there is a unit u from 
D* such that p = u-q. The relation of being associate is an equivalence relation 
(Exercise 10). By Remark 8.3, associates of irreducibles are irreducible. 
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DEFINITION 8.13 (UFD.) A ring D is a unique factorization domain, 
or UFD, if it ts an integral domain in which the following properties hold: 


(a) (Existence of factorizations.) Each nonzero, nonunit element a of D 
has a factorization 
@ = Pip2---Pm (8.5) 
as a (nonempty) product of irreducible elements pi, po,..-,Pm of D; 
(b) (Uniqueness of factorizations.) For each nonzero, nonunit element a 


of D, the factorization (8.5) is unique to within reordering and passage 
to associate irreducibles. In other words, if 


@ = P1P2---Pm = 192 -+-In (8.6) 
with irreducibles p,,p2,---;Pm)Q1;42;---,4n in D, then m = n, and 
there is a permutation m of {1,2,...,n} such that p; is an associate of 


Q(z) for each 1 <j <n. 


In considering the uniqueness part of Definition 8.18, it is useful to ex- 
tend the defining property Definition 8.2(b) of a prime element in an integral 
domain. 


PROPOSITION 8.14 (“Divide and conquer.” ) 
Let p be a prime element of an integral domain D. If p divides a product 
a,...Gn in D, then it divides one of the factors a; for 1 <i<n. 


PROOF By induction on n, the induction basis n = 2 being a direct 
application of Definition 8.2(b). Suppose the proposition holds for all products 
with less than n factors, for an integer n > 3. Then by Definition 8.2(b), 
p|a1..-@n—14y implies p | a1...a@,—1 or p | an. The second case immediately 
gives the desired result. In the first case, p | a; for some 1 < i < n by the 
induction hypothesis. 


PROPOSITION 8.15 (Irreducible = prime in a UFD.) 
Let D be an integral domain satisfying the property of Definition 8.13(a). 
Then the following two conditions are equivalent: 


(a) D is a unique factorization domain; 


(b) Each irreducible element of D is prime. 


PROOF (a) implies (b): Suppose that p, is an irreducible element of a 
unique factorization domain D. Suppose that a product bc in D is a multiple 
pid of p,;. Suppose that the elements b, c, and d have respective factorizations 


b=q.---r, C= Gr41 +++ Gn; and = d=p2...Pm 
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as products of irreducibles in D. The element 
a=pid=b-c 
of D then has the factorizations 


Qa = Pip2---Dm = W1---Or* Ur41---dn;, 


so p, is the associate of dp(1), SAY P1 = UGn(1) OT Imi) = Up for uv = 1. If 
m(1) <r, we have p, | 6, while if 7(1) > r, we have p, | c. 

(b) implies (a): We will prove, by induction on m, that the uniqueness 
property of Definition 8.13(b) holds for each nonzero, nonunit element a of D. 
The induction basis is the case m = 1 of (8.6). In this case, the factorization 
M192 ---Qn of the irreducible element p; must be trivial, son = 1 and p; = qq. 
As the induction hypothesis, suppose that the uniqueness property holds for 
all factorizations with less than m irreducible factors p;. Consider (8.6) with 
m > 1. By the assumption (b), the irreducible element p,,, of D is a prime 
divisor of q,...Gn. The “divide and conquer” Proposition 8.14 implies that 
Pm divides some factor g, with 1 <r <n. Since q, is irreducible, it can only 
be a multiple gq, = up of py, with a unit u. In particular, p,, is an associate 
of q,. Now 

P1-++-Pm-1Pm = Q1- ++ Ir-1UPm4r41Ur+2 +++ In 
implies 
Di 6 Diat = G1 00s pa (Ur Geo 3 On (8.7) 
by cancellation. Set 


di forl<i<r; 
q = ug@41 fori=r; 


G+ forr<i<n. 
The equality (8.7) reads as 
Pi-+-Pm—1 = Gi ---G—1 


with irreducibles qj,...,q/,_1. By the induction hypothesis, n — 1 = m—1, 
and there is a permutation 7’ of {1,...,m-—1} such that each p, for 1 <j <m 


is an associate of q/, (j)" Now define 
(9) for 1 <a’) <r; 
mj) =< a'(j)+1 forr<a(j)<m; 
r for j =m. 


For 1 <j < _m, the irreducible p; is an associate of Teg) which in turn is an 
associate of q,(;)- Also, the irreducible p,, is an associate of q- = de(m)- Thus 
the uniqueness property holds for factorizations with m irreducible factors p,;, 
as required to complete the inductive proof. 
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THEOREM 8.16 (Every PID is a UFD.) 
Let D be a principal ideal domain. 


(a) Each irreducible element p of D is prime. 


(b) The integral domain D is a unique factorization domain. 


PROOF (a): Suppose p | ab, say ab = pc with c in D. Consider the set 
J ={ap+ya| x,y in D} (8.8) 


of “D-linear combinations” of p and a. Now J is an ideal of D (Exercise 14). 
Since D is a principal ideal domain, J is just the set dD of multiples of some 
element d of D. As 1p+0a, the irreducible element p of D lies in J, and is thus 
expressible as a multiple p = dz of D. Since p is irreducible, the factorization 
p = dz is improper. 


(i) If dis a unit, then J = dD = D contains 1, say 1 = a1p+ ya. 
Now b = 1b = x1pb+ yiab = x1bp + yicp = (x1) + yic)p, so p 
divides 6 in this case. 

(ii) If z is a unit, J = dD = dzD = pD by Proposition 8.1, so p 
divides the element a = 0p + la of J in this case. 


(b): By Corollary 8.12, D satisfies the existence property of Definition 8.13(a). 
Using part (a) of the theorem, part (b) then follows from Proposition 8.15. (] 


8.4 Roots of polynomials 


Let F' bea field. By Theorem 7.27 (page 167), the ring F'|X] of polynomials 
over F' is a principal ideal domain. Theorem 8.16(b) then shows that F'|X] is 
a unique factorization domain. The remainder of the chapter will rely heavily 
(and implicitly) on this property of such rings FX]. Note that the group of 
units FX]* of the ring FLX] is the group F* of nonzero constant polynomials 
(elements of F’). In particular, each polynomial of positive degree is associate 
to a unique monic polynomial (Exercise 12). 

A field F' is said to be a subfield of a field E if F’ is a subset of FE, and the 
inclusion j : F< E is aring homomorphism. In this context, the field E is 
also described as an extension (field) of F. If there is a further field K with 
inclusions F — K < E that are ring homomorphisms, then K is described 
as an intermediate field between F and E. If aj, ..., Gy, are elements of E, 
then F(a1,...,@n) will denote the smallest intermediate field containing the 
set {a1,...,@n}. For example, C is an extension field of Q, with intermediate 
field R. Then Q(i) = {a + iy | x,y in Q}. 
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In its most basic application, the field C of complex numbers is used to 
furnish a root i = /—1 of the polynomial X? +1 from R[X]. The polynomial 
X? +1 then factorizes in C[X] as the product 


X? 41=(X +1)(X -1) 


of linear factors. The following result, known as Kronecker’s Theorem, shows 
that each nonconstant polynomial over a field has a root in some extension 
field. 


THEOREM 8.17 (Kronecker’s Theorem.) 
Let F be a field. If f(X) is a nonconstant polynomial in F(X], then f(X) 
has a root in some extension field E of F. 


PROOF As anonconstant polynomial, f(X) is nonzero and noninvertible 
in the Noetherian domain F'[X]. Proposition 8.10 shows that f(X) has an 
irreducible factor p(X). If p(X) has a root x in an extension field EF, then 
f(x) =0 in E. 

Let J be the ideal p(X) FX] of F[X]. Since p(X) is irreducible, the quotient 
ring E = F[X]/J is a field (Theorem 7.39, page 171). Let x be the element 
X+J of E. Then p(x) = p(X) + J = p(X) + p(X) FX] = p(X)F[X] = J,” 
so x is a root of p(X) in E. (] 


Example 8.18 (The square root of —1.) 

Kronecker’s Theorem builds R[X]/(X?+1)R[X] as the extension field E of R 
in which the real polynomial X?+1 has a root. As discussed in Example 7.29 
(page 168), this field E is isomorphic to the field C of complex numbers. 
Moreover, the root « = X + (X? + 1)R[X] of X? +1 in E maps to the 
complex number 7 under the isomorphism (7.16). 


The real polynomial 
F(X) = X4*4-2X741=(X74+17 (8.9) 


has no real roots. However, in the extension field C of R, it has 7 as a double 
root, since it factorizes as X44+ 2X? 41 = (X +i)?(X — i)? in C[X]. There 
is a way to recognize that the real polynomial f(X) has a repeated root in 
the extension field C, without leaving the field of real numbers. Consider the 
derivative 

f'(X) =4X2 44% =4X(X* +1), 


computed as usual in calculus. It is then apparent that f(X) and f’(X) share 
the common, nonconstant factor X? + 1 in R[X]. 

If F is an abstract field, there is no analytical method to differentiate a 
polynomial f(X) in FX]. However, one may define the derivative Df(X) in 
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F'|X] of a polynomial f(X) in F[X] using the formal rules 
Dane 


and 
D(af(X) + bg(X)) = aDf(X) + bDg(X) 


for positive integers n, for constants a, b in F’, and for polynomials f(X) and 
g(X) in FX]. The following properties are readily verified, just as in calculus 
(Exercise 22). 


PROPOSITION 8.19 (Rules for differentiation.) 
Let F be a field. 
(a) For a polynomial 
p(X) = prnX” + pn1X" 1) +--+ p1X + po 
in F[X], the derivative is given as 
Dp(X) = nppX”* + (n—1)pp1X" 7 +++ + pr. 
(b) The product rule holds: 


D(f(X)- 9(X)) = Df(X) - g(X) + f(X)- Dg(X) 


for polynomials f(X) and g(X) in F[X]. 


The formal derivative may then be used to spot when a polynomial will 
have repeated roots in some extension field. 


THEOREM 8.20 (Derivatives and repeated roots.) 
Let F be a field, and let f(X) be a nonconstant polynomial in F[X]. The 
following conditions are equivalent: 


(a) There is an extension field E of F in which f(X) has a repeated root. 


(b) The polynomials f(X) and Df(X) share a common factor of positive 
degree in the unique factorization domain F|X]. 


PROOF (a) implies (b): Suppose that there is a factorization 


f(X) = (X — a)’9(X) 


in the ring E(X) of polynomials over an extension field F of F. By the 
product rule, 


Df(X) = 2(X — a)g(X) + (X — a)’Dg(X). 
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Thus f(a) = 0 and Df(a) = 0. Let j : F — E be the inclusion of F in 
E. The Substitution Principle (Theorem 6.39, page 145) gives a unique ring 
homomorphism jq : FX] — E;h(X) + h(a) with ja(X) = a. The ring kernel 
Ker j, in the principal ideal domain F'[X] is the set p(X)F'|X] of multiples of 
a certain nonconstant polynomial p(X). Since f(a) = 0 and Df(a) = 0, the 
polynomial p(X) is a common factor of f(X) and DF(X) in F[X]. 

(b) implies (a): Suppose that p(X) is a nonconstant factor of both f(X) 
and Df(X) in FX]. By Kronecker’s Theorem, there is an extension field E 
of F in which p(X) has a root a, so f(a) = 0 = Df(a) in E. Then 


I(X) = (X —a)h(X) (8.10) 


in E[X], and 
Df (X) = h(X) + (X — a)Dh(X) 


by the product rule. Now Df(a) = 0 in E implies h(a) = 0 in E. From (8.10), 
it is apparent that f(X) has a as a repeated root in E. 


8.5 Splitting fields 


Let f(X) be a polynomial over a field F'. The polynomial f(X) is said 
to split over an extension field E of F if f(X) factorizes as a product of 
linear factors in E[X]. For example, the polynomial X+ + 2X? +4 1 of (8.9), 
considered as a polynomial over the field Q of rationals, splits over the field of 
complex numbers. The splitting of nonconstant polynomials is a consequence 
of Kronecker’s Theorem. 


PROPOSITION 8.21 (Every nonconstant polynomial splits.) 
Let f(X) be a nonconstant polynomial over a field F. Then f(X) splits over 
some extension field E of F. 


PROOF = As an induction hypothesis, assume that the proposition is true 
for all fields F and for all nonconstant polynomials of degree less than some 
positive integer n. Note that the proposition is trivial if deg f = 1. Suppose 
deg f=n>1. 

Case A: If f(X) has a root a in F, then it factorizes as (X — a)g(X) in 
F |X], with deg g = n—1. By induction, g(X) splits over some extension field 
E of F. Then so does f(X). 

Case B: If f(X) has no root in F, it has an irreducible factor p(X) in 
F'|X]. By Kronecker’s Theorem, there is an extension field K of F' such that 
p(X) has a root a in K. Then f(X), as a nonconstant polynomial of degree 
nin K[X], has a root a in K. Case A now applies, with F' replaced by K. 
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It follows that f(X) splits over some extension field E of K. Since F is a 
subfield of K, the field EF is an extension field of F’. 


The proof of Proposition 8.21 embodies a process for finding an extension 
field E over which a polynomial f(X) from FX] will split. Consider the 
example of f(X) = X*+2X?+1 in Q[X]. Since f(X) has no root in Q, Case 
B becomes relevant. Now f(X) has the irreducible factor X? + 1 in Q[X]. 
Kronecker’s Theorem builds 


Qli) = QX] /(X? + Q(X] = {x + ty | x,y in Q} (8.11) 
as an extension field of Q in which X? + 1 has a root, namely 
i= X+(X?+1)Q[X]. 


At this point, we are transferred to Case A, working with the polynomial f(X) 
as an element of Q(i)[X]. Factorization in Q(i) |X] yields f(X) = (X—i)g(X) 
with g(X) = (X +1)?(X — i). Since g(X) already splits over Q(i), this is the 
extension field over which f(X) will split, as f(X) = (X — i)?(X + i)?. 

Let F be a field. Let f(X) be a nonconstant polynomial in FX]. Suppose 
that f(X) splits over an extension field L of F, say as a product 


F(X) = (X —h)(X — by)... (X= In) 
of a nonzero constant c from F and (not necessarily distinct) linear factors 
(X —1;) with 1; in L. The smallest subfield F(1,...,ln) of Z containing F 


and the set {l,...,J,} of roots is called a splitting field of f(X) over F. In 
other words, a field K is a splitting field for a polynomial f(X) in FLX] if 


e f(X) =co(X —c)...(X — cp) in K[X] (with co,c1,...,¢n in K), and 
e f(X) doesn’t split over any proper intermediate field between F' and K. 


Note the dependence on the choice of coefficient field F' for the polynomial 
f(X), just as with the definition of irreducibility. 


Example 8.22 (Dependence of the splitting field.) 
With F = Q and f(X) = (X?+4+ 1)”, we obtain (8.11) as a splitting field. On 
the other hand, C is a splitting field for f(X) = (X?2 +1)? over F=R. (] 


Example 8.23 (Use of the quadratic formula.) 
Consider the problem of specifying a splitting field for the quadratic equation 
f(X) SX? -X +1 


over Q. Recall that for complex numbers a, b, and c, the quadratic polynomial 
aX? + bX +c has roots given by the formula 


—b+ Vb? — 4ac 
2a , 
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Applying that formula to f(X) in CX], we obtain the two roots $(1 + ivV3). 
These roots lie in the field Q(iV3) or QLX]/(X? + 3)Q[X], giving 


f(X) = (X- 5 (1 +4v3)) (X= 5(1—#v3)) 


as the factorization in Q(iV/3)[X]. Since iv3 is not rational, no smaller field 
will split f(X). Thus Q(iV3) is the splitting field. l 


Example 8.24 (Roots of unity.) 
Let n > 2 be an integer. To specify a splitting field for X” — 1 over Q, 
consider the complex number 


= [fit/n) ealann) [PS 


The powers w’, for 0 <r <n, are n roots of X” —1. They are known as roots 
of unity. Then 


X” —1=(X -1)(X —w)(X —w?)...(X —w™") 


in Q(w), and Q(w) is seen to be a splitting field for X" — 1 over Q. (] 


8.6 Uniqueness of splitting fields 


The proof of Proposition 8.21 provides a technique for obtaining a splitting 
field of a nonconstant polynomial f(X) in the ring of polynomials over a 
field F’. However, the technique involves some arbitrary choices: choice of a 
particular root a in Case A, or choice of a particular irreducible factor p(X) in 
Case B. Different choices at these points cause a branching in the construction 
process that could potentially lead to different splitting fields. In this section, 
it will be shown that the divergent paths eventually land back in the same 
place: Any two splitting fields for f(X) over F' are isomorphic. 

We begin with the case of an irreducible polynomial. 


PROPOSITION 8.25 

Let F be a field, and let p(X) be an irreducible polynomial over F. Suppose 
that a; is a root of p(X) in an extension field E; of F, fori =1,2. Then the 
fields F(a,) and F(ag) are each isomorphic to F[X]/p(X)F[X]. 


PROOF For i= 1,2, let 7‘: F — E; be the inclusion of F in the extension 
field E;. The Substitution Principle (Theorem 6.39, page 145) gives a unique 
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ring homomorphism jj, : F[X] > Ej;h(X) + h(a;) with j,(X) = a;. The 
ring kernel Ker 7}, in the principal ideal domain F'[X] is the set qi(X)F[X] of 
multiples of a certain nonconstant polynomial q;(X). Now p(a;) = 0 implies 
that the irreducible polynomial p(X) is a multiple of q;(X). Since p(X) is 
prime in the principal ideal domain F'[X], the polynomials p(X) and q;(X) are 
associates, so Ker ji, = qi(X)F[X] = p(X)F[X]. The image of 7, is F(a;), 
so the First Isomorphism Theorem for rings gives a well-defined isomorphism 


b; : F[X]/p(X)F[X] > F(a;); h(X) + p(X) F[X] © h(a;) (8.12) 
for i= 1,2. J 


A slight refinement of Proposition 8.25 is needed for the treatment of the 
general case. 


PROPOSITION 8.26 

Let F be a field, and let p(X) be an irreducible polynomial over F. Suppose 
that a, is a root of p(X) in an extension field of F. Let 0: F + G,;x+& be 
a field isomorphism, with corresponding ring homomorphism 


Ox : FLX] > GLX]; h(X) > A(X) 


given by the Substitution Principle. Let az be a root of p(X) in an extension 
field of G. Then there is a field isomorphism @ : F(a.) + G(az), restricting 
to0: F > G, with 0(a1) = ag. 


PROOF The ring homomorphism 6x : F[X] — G[X] is an isomorphism, 
with two-sided inverse G[X] > FX]; h(X) + h(X). Thus p(X) is irreducible 
in G[X]. By Proposition 8.25, there is an isomorphism 

by : F[X|/p(X)F[X] — F(a1); A(X) + (X)F[X] > h(an) 
as in (8.12), restricting to the identity map on F in the form 
Fo F;r4+p(X)F[X]H «a. 
Similarly, there is an isomorphism 
bs : G[X]/B(X)G[X] > Gaz); F(X) + B(X)G[X] + Bla2) 
as in (8.12), restricting to the identity map on G in the form 
G— G;F4+ p(X)G[X] =. 


Moreover, the First Isomorphism Theorem for rings, applied to 


F(X] > G[X]/p(X)G[X]; h(X) & h(X) + p(X)G[X], 
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yields a well-defined isomorphism 
b: F(X] /p(X)F[X] 
h(X) + p(X) F(X] 


G[X]/p(X)G[X]; 
A(X) + P(X)G[X]. 


=> 
i 


The desired isomorphism @ : F (a1) > G(ag) is then realized by by obo b; 1. 


0 
c+iwy it > xu—1y 
by be 
b 
xtyX + (X?+1)R[X] | ~ c+yX+(X?+1)R[X] 


FIGURE 8.1: Complex conjugation from Proposition 8.26. 


Example 8.27 (Complex conjugation.) 

For the irreducible polynomial p(X) = X? +1 in R[X], consider the root 
a, =i in the extension field C = R(z). Take 0: R — R to be the identity map 
idg on the reals. Take az to be the root —i of p(X) = X?+1. Then the field 
isomorphism 6 : R(i) + R(—i) with 6(i) = —i is the complex conjugation 

CHC z=¢#+iy%=axr-y 


(Figure 8.1). Compare Study Project 1 in Chapter 6. l] 


It will now be shown that splitting fields are unique up to isomorphism. 


PROPOSITION 8.28 
Let F be a field. Let f(X) be a nonconstant polynomial in F|X]. Let 


O:FS3Gj tre 
be a field isomorphism, with corresponding ring homomorphism 
Ox : F[X] > G[X]; A(X) & A(X) 


given by the Substitution Principle. Suppose that K is a splitting field for 


f(X) over F[X], and that L is a splitting field for f(X) over G. Then there 
is a field isomorphism 0: K — L extending @: F —G. 
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PROOF As an induction hypothesis, suppose that the proposition is true 
for all fields F' and for all nonconstant polynomials of degree less than some 
positive integer n. (Note that the proposition is trivial if deg f = 1, since 
K=F,L=G, and 6: K = L is just 6: F — G in that case.) Suppose 
deg f=n>1. 

Let p(X) be an irreducible factor of f(X). Let a, be a root of p(X) in 
kK, and let ag be a root of p(X) in L. By Proposition 8.26, there is a field 
isomorphism 6 : F(a,) > G(az) extending 0: F — G, with 6(a,) = ag. 
Within the polynomial ring F(a,)[X], the polynomial f(X) factorizes as 
f(X) = (X — ay)b(X). 

Since deg f > 1, the polynomial b(X) in F(a1) |X] is nonconstant, of degree 
n—1. Now K is a splitting field for b(X) over F(a,). The field isomorphism 


6: F(a.) > G(az) 
furnishes a ring isomorphism 
Ox : F(a1)[X] > G(a2)[X]; A(X) & A(X) 


by the Substitution Principle. Note that Ox extends 0x. Under the ring 
isomorphism 0x, the factorization f(X) = (X — a,)b(X) in F(a1) |X] maps 
to f(X) = (X—ay)b(X) in G(az)[X]. Thus L is a splitting field for 6(X) over 
G(az). By the induction hypothesis, there is a field isomorphism 0: K — L 
extending 6 : F(a,) > G(az). Since 6 : F(a1) + G(az) extends 6: F > G, 
it follows that 6: K — L extends 6: F — G. This completes the inductive 
proof. 


THEOREM 8.29 (Uniqueness of splitting fields.) 
Let F be a field. Let f(X) be a nonconstant polynomial in F|X]. Suppose that 
K and L are splitting fields for f(X) over F. Then K and L are isomorphic. 


PROOF In Proposition 8.28, take 6: F — G to be the identity map 
idp: F > Fon F. l] 


Example 8.30 (Splitting X” — a over the rationals.) 

Let n > 2 be an integer. Let a be a positive real number. Consider the 
problem of determining the splitting field for X” — a over Q. Since x x” 
is a strictly increasing, continuous function on the set of positive reals, there 
is a unique positive real number r with r” = a. This number is denoted by 
a/". The field Q(a!/”) is a subfield of R. In Q(a’/")[X], the polynomial 
X” —a has a linear factor X —a'/", but does not factorize further. However, 
in Q(a'/")(w), with w as in Example 8.24, the polynomial X” — a splits, with 
roots of the form a!/"w* for 0 < k <n. Thus the splitting field for X” — a 
over Q is Q(al/”,w). 
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8.7 Structure of finite fields 


Some finite fields made their appearance in Chapter 7. In the final two 
sections of this chapter, we undertake a more comprehensive study, starting 
with an observation about finite abelian groups. 


PROPOSITION 8.31 
Let A be a finite abelian group. Let x be an element of A whose order d, is 
maximal. Then for each element y of A, the order dy of y divides dy. 


PROOF If d, does not divide dz, then dy has a prime factor p which 
does not divide d,. Since gcd(p,d,) = 1, there are integers 1 and m with 
lp + md, = 1. 

Now z = y*v/? has order p. Consider the subgroup (xz) of A. It contains 


((ez)'?) = (a'P2!P) = (at ™4") = (a) 
as a subgroup of order d,, and 
{(a2) 4) = (mee gman) — (2t-lP) = (2) 


as a subgroup of order p. By Lagrange’s Theorem, the order of xz is a multiple 
of pd,. This contradicts the maximality of d,. 


PROPOSITION 8.32 (Multiplicative groups of finite fields.) 
Let K be a finite field. Then the group k* of nonzero elements of K is cyclic. 


PROOF Let e be an element of AK* of maximal (multiplicative) order d. 
By Proposition 8.31, each element of K* is a root of the polynomial X¢ — 1. 
By Proposition 7.24 (page 166), |A*| < d. On the other hand, d = |(e)| isa 
divisor of |K*|, by Lagrange’s Theorem. Thus K™ is the cyclic group (e). 


An element e of a finite field K is called a primitive element if K* = (e) 
— compare Study Project 3 in Chapter 7. Now if F is a subfield of a finite 
field EF, the additive group (F,+,0) of F is a subgroup of the additive group 
(E£,+,0) of E. Lagrange’s Theorem then shows that the order |F| of F isa 
divisor of the order |E| of E. In fact, a much stronger statement is true. 


PROPOSITION 8.33 (Additive groups of finite fields.) 

Let E be a finite field, with a subfield F. Then the additive group (E,+,0) of 
E is isomorphic to a power of the additive group (F,+,0) of F. In particular, 
the order |E| of E is a power of the order |F| of F. 
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PROOF Let e be a primitive element of E. Let j : F — E be the inclusion 
of F in E. The Substitution Principle (Theorem 6.39, page 145) gives a unique 
ring homomorphism j. : F[X] — E with j.(X) = e. The ring kernel of j. in 
the principal ideal domain FX] is the set p(X)F[X] of multiples of a certain 
polynomial p(X). If the degree of p(X) is m, Proposition 7.31 (page 169) 
shows that the additive group of F'X]/p(X)F'[X] is isomorphic to (F,+,0)™. 

Each nonzero element of E, as a power e” of e, appears as j-(X”) in the 
image of j.. Thus j-(F'|X]) = E. The First Isomorphism Theorem for Rings 

(Theorem 6.34, page 140) applied to 7. : F[X] — E yields the ring isomor- 
phism 

F(X|/p(X)FIX| SE. 


It follows that (F,+,0) is isomorphic to the power (F,+,0)™ of (F,+,0). (] 


PROPOSITION 8.34 (Prime subrings of finite fields.) 
Let K be a finite field. Then the prime subring of K is the field Z/, of 


integers modulo a prime number p. 


PROOF § Since K is finite, its prime subring is finite, and thus of the form 
Z/q for a positive integer d. If d is composite, say d = ab with 1 < a,b < d, 
then a-b=0 in K. This cannot happen in the integral domain Kk. 


COROLLARY 8.35 

The additive group (K,+,0) of the finite field K is isomorphic to a power 
(Z/p,+,0)” of the group (Z/p,+,0) of integers modulo p under addition. In 
particular, the order |K| is a power p” of the prime number p. 


PROOF Set E = K and F =Z/, in Proposition 8.33. l 


Proposition 8.34 shows that the characteristic of a finite field K is a prime 
number p. Now in a ring R of characteristic p, the Frobenius map 


yp: Ra Raw aP 


is aring homomorphism (Exercise 45 in Chapter 6). In an integral domain of 
characteristic p, the Frobenius map is injective (Exercise 35). 


PROPOSITION 8.36 (Subfields and the Frobenius map.) 
Let K be a field of characteristic p. Then for each natural number r, the 
subset 


L= {ax | p"(«) = x} 
of Kk forms a subfield of Kk. 
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PROOF The set L is nonempty, indeed y"(1) = 1. Now suppose that x 
and y are elements of L. Then 


g'(x-y) =9"(z)-¢'(y)=2-y 


and 

g(e-y=e'(a) ey) =a-y, 
so L is a unital subring of the integral domain Kk. As such, FL is an integral 
domain. Each element of L is a root of the polynomial X™ — X, with m = 
p", over the field Z/,. Proposition 7.24 (page 166) shows that the set L is 
finite. By Proposition 7.16 (page 162), it follows that the integral domain L 
is actually a subfield of Kk. 


8.8 Galois fields 


Corollary 8.35 shows that the only possible orders for a finite field are the 
powers of a prime number. The following theorem shows that for each power 
q = p” of a prime number p (with n > 1), there is a field of order g. To 
within isomorphism, this field is unique. It is known as the Galois field GF(q) 
of order q. In this notation, the field Z/, of integers modulo p is written as 
GF(p). 


THEOREM 8.37 (Classification of finite fields.) 
Let q= p” be a power of a prime number p, with positive index n. 


(a) There is a field GF(q) of order q. 
(b) Each field K of order q is isomorphic to GF(q). 


PROOF (a): Consider the splitting field E of the polynomial X% — X in 
Z/,[X]. The set of roots of X4— X in E is the subset 


GF(q) = {x | e"(2) = 2} 


of EF. By Proposition 8.36, GF(q) is a subfield of E. In fact, since X47 — X 
splits over GF(q), we have E = GF(q). Now the derivative of X4— X is 

gXt 1 -L=p"Xt!-1=-1, 
which does not have any factor of positive degree. Theorem 8.20 shows that 
X4— X, as a polynomial of degree qg, has no repeated roots in any extension 
field. Thus its set GF(q) of roots has exactly q elements. 
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(b): If K is a field of order g, Proposition 8.34 shows that K contains Z/, 
as a subfield. Since K* is a cyclic group of order q — 1, each nonzero element 
of K is a root of the polynomial X¢ — X in Z/,[X], while 0 is certainly a 
root. Thus K is a splitting field of the polynomial X? — X in Z/,[X]. As 
such, it is isomorphic to the splitting field GF(q) for X4— X over Z/,[X], by 
Theorem 8.29. U 


For a prime power gq = p”, the final question concerns the possible subfields 
of GF(q). By Proposition 8.33, the only possible orders for subfields are 
the powers p” in which r divides n. Conversely, for such a power m = p’, 
Proposition 8.36 shows that GF(q) does have a subfield 


L= {a | p"(«) = x} 


consisting of the roots of the polynomial X™ — X in GF(q). By Theorem 8.20, 
this polynomial has no repeated roots, so L is a subfield of order m = p’. 
Furthermore, it is the unique subfield of this order. Indeed, the elements of 
any subfield K of order p” would consist of roots of X™ — X, and would thus 
lie in LZ. We summarize as follows. 


PROPOSITION 8.38 (Subfields of finite fields.) 

Let q= p” be a power of a prime number p, with positive index n. The only 
possible orders of subfields of GF(q) are the powers p" for a divisor r of n. 
For each such power m = p", there is a unique subfield GF(m) of GF(q). 


For a prime number p, the subfields of GF(p") are displayed in Figure 8.2. 


Note that this figure is essentially just a relabelled version of Figure 1.3, which 
displayed the positive divisors of 72. 


GF(p°) —+ GE (p!®) — GF (p°) —> GE(p”) 


FIGURE 8.2: The subfields of GF(p”). 
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8.9 Exercises 


1. Let a be a nonzero element of a zero ring A (compare Example 6.6, page 
128). Show that a does not factorize as the product a = b-c of any pair 
of elements b and c of A. 


2. Let u be a unit of an integral domain D. Show that a nonzero, nonunit 
element a of D is irreducible if and only if ua is irreducible. 


3. Let A be a matrix in the ring Z[/—3] of Example 8.6. If det(A) is a 
prime number, show that A is irreducible. 


4. Give an example of an irreducible element A of Z[./—3] for which det (A) 
is a composite number. 


5. Show that the union (8.4) of the chain (8.3) is an ideal of the ring R. 


6. Consider the ring R® of all functions f : R — R, with componentwise 
unital ring structure (compare Example 6.12, page 130). For each nat- 
ural number n, define 


In ={f:R-R| f(x) =0ifa>n}. 


(a) Show that J, is an ideal of R® for each natural number n. 
(b) Show that 


Jo ; Ji om arene! In In+1 - 


is an ascending chain of ideals in R®. 


(c) Show that the ideal J, is a proper subset of the ideal J,,41 for each 
natural number n. 


(d) Conclude that R® does not satisfy the ascending chain condition. 


7. Let D be an integral domain. Show that the following two conditions 
are equivalent: 


(a) Each nonzero, noninvertible element a of D factorizes as a product 
of irreducibles in D; 


(b) D has no ascending chain 


agD a;Do... AyD An4iDO... 


of principal ideals with a,.D properly contained in a,+41D for each 
natural number n. 


8. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 
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A ring R is said to satisfy the descending chain condition, or DCC, if 
for each descending chain 


ae ee Jn—1 < 34a J1 < Jo=R 


of ideals J, of R, there is a natural number N such that J, = Jn for 
all m > N. Suppose that an integral domain D satisfies the descending 
chain condition. Show that D is a field. 


. Show that the integer 12 may be factored as a product of irreducibles 


in Z in 12 different ways (ordering the factors). 


Define a relation A on an integral domain D by setting x A y if and 
only if 7 = u-y for some u in D*. 
(a) Show that A is an equivalence relation on D. 


(b) Show that the group D* of units of D is the equivalence class of 1 
under the equivalence relation A. 


(c) Show that {0} is the equivalence class of 0 under the equivalence 
relation A. 


Show that (3.2) gives the associate classes in Z. 


Let F' be a field, and let p(X) be a nonzero, nonunit element of the 
integral domain FLX] of polynomials over F. Show that p(X) is the 
associate of a unique monic polynomial in FLX]. 


Let D be an integral domain, and let Id D be the set of ideals of D. 
Show that the relation of being associate is the kernel relation ker a of 
the function 

a: D—-IdD;araD. 


Verify that the set J of (8.8) is an ideal of the integral domain D. 


Show that the ring Z[/—3] of Example 8.6 is not a principal ideal 
domain. 


Show that each field F is a subfield of the field F'(X) of rational functions 
over F — compare Example 7.47, page 177. 


If & is an extension field of a field F', show that F(X) is an extension 
field of F(X). 


Find the inverse of a nonzero element x + iy of Q(i). 


Find the inverse of a nonzero element x + yv/5 of the field Q(V/5) = 
{a +yv5 | x,y in Qh. 
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20. 


21. 
22. 
23. 


24. 


25. 


26. 


27. 
28. 


29. 


30. 


3l. 
32. 
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Find a monic polynomial p(X) in Q[X] such that the quotient field 
Q[X]/p(X)Q[X] is isomorphic to the field Q ( 1+ V5). 

Show that Q(V2, V3) = Q(V2 + V3). 

Verify the claims of Proposition 8.19. 


Let f(X) = X*++ X?+1 in Z/2[X]. Show that f(X) is a solution of 
the differential equation Df(X) = 0. 


Let F be a field of characteristic zero. Show that the only solutions 
f(X) to the differential equation Df(X) = 0 in F[X] are the constant 
polynomials f(X) = c for cin F. 


Specify a splitting field for the quadratic equation 
f(X) =X? 4+ X41 

over Q. 

Specify a splitting field for the quadratic equation 
f(X) = X24 X41 

over Z/9. 

Specify the splitting field for X3 — 2 over Q. 


Find the smallest positive integer n for which the splitting field of the 
polynomial 
X* 44X42 


over Q is Q(/n). 
Determine the splitting field for 
X°42X? 44x +8 

over Q. 
Determine the splitting field for 

X* + 10X* + 100X? + 1000X + 10000 
over Q. (Hint: Set n = 5 and a = 10° in Example 8.30.) 
Show that the fields Q(/2) and Q(/3) are not isomorphic. 


Give an example of a finite group G with two elements x and y, such 
that 2 has maximal order dz in G, while the order d, of y is not a divisor 
of dy. Why does this not contradict Proposition 8.31? 
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33. In the proof of Proposition 8.33, show that the polynomial p(X) is 
irreducible in FLX]. 


34. (a) Show that there are no nontrivial finite subgroups in an infinite 
cyclic group. 
(b) Give an example of an infinite field K whose group K™* of nonzero 


elements is not cyclic. 


35. Let p be a prime number. 


(a) Let R be an integral domain of characteristic p. Show that the 
Frobenius map 
yp: RR xr a? 
is injective. 
(b) Let R be a finite field of characteristic p. Show that the Frobenius 
map 
yp: RR xr a? 
is surjective. 
(c) Show that a polynomial f(X) in Z/,[X] is the image y(g(X)) of 
some polynomial g(X) under the Frobenius map 
yp: Z/,|X] > Z/,[X] 


if and only if it is a solution of the differential equation Df (X) = 0. 


(d) Show that the field Z/,(X) of rational functions over the integral 
domain Z/, has characteristic p. 


(e) Show that the Frobenius map 
pi Zp(X) + Z/p(X) 
fails to be surjective. 


36. Consider the following ideals in the ring R = Z/2[X] of polynomials 
over Z/9: 
Jy = (XP +X 41)Z/2[X], 
Jog = (XP 4+ X72 4-1)Z/o[X], 
Jn = (X°4.X74X 4 1)Z/2X]. 


(a) Show that the quotient rings R/J, and R/J2 are isomorphic. 
(b) Show that the quotient rings R/J, and R/J3 are not isomorphic. 


37. Let F be a finite field, and let n be a positive integer. Show that there 
is an irreducible polynomial of degree n in FX]. 
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38. Let p be a prime number, and let m and n be positive integers. 


(a) Show that p™ — 1 is a factor of p”™’ — 1. (Hint: Use Lagrange’s 
Theorem, or Exercise 31 from Chapter 7.) 


(b) Let e be a primitive element of GF(p™”). Let r be the integer 


Show that the subfield GF(p™) of GF(p™”) consists of 0 and the 
powers of e”. 


39. Find the integer q such that GF(q) is the splitting field of the polynomial 
X44 X41 over Z/o. 


40. Let p(X) be an irreducible polynomial of degree 5 in Z/[X]. Let J 
be the ideal p(X)Z/2[X] of Z/2[X]. Explain why X + J is a primitive 
element of Z/2[X]/J. 

41. For 

cos Zn + isi 28 
w = cos — + isin — 
13 is 
show that Q(—w) is the splitting field of X2” — X over Q. 


42. What is the splitting field of X27 — X over GF(3)? 


8.10 Study projects 


1. The Sieve of Eratosthenes is a method for generating a partial listing 
of the irreducible elements in an integral domain whose associate classes 
may be ordered systematically. In its classical version, it produces a list 
of prime numbers, as illustrated in Figure 8.3. 


2/3/4/5/6]7]/8]9) 10/11) 12)13) 14/15] 16} 17) 18) 19 | 20) 21 | 22) 23 | 24) 25 
2/3} |5} |7] 19 11 13 15 17 19 21 23 25 
2/3] [5] |7 11 13 17 19 23 25 
2/3] [5] |7 11 13 17 19 23 


FIGURE 8.3: The Sieve of Eratosthenes. 
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The first row lists the integers bigger than 1 in increasing order. In the 
second row, all the multiples of 2, except for 2 itself, are deleted. The 
element to the right of 2 is 3, so at the next step all the proper multiples 
of 3 are deleted. Continuing in this fashion, the sieve leaves a list of 
the prime numbers. In Figure 8.3, the last row has just had the proper 
multiples of 5 deleted. It already exhibits the list of primes less than 25. 


The sieve may be used to produce a list of irreducible polynomials over 
Z/. For convenience during the calculation, a polynomial 


f(X) = anX” + an X"* +--+ +a, X + ag 


in Z/2[X] may be represented by the binary expansion d)@n_1...@1@9 of 
the integer obtained by interpreting f(X) in Z[X], and then computing 
f(2). For example, the polynomial X4 + X° +1 is represented as 


a* +23 4+1= 11001 
(i.e., the binary expansion of 25). The elements f(X) are then listed in 
increasing order of the corresponding integers f (2). 


Use the sieve to show that the following is a complete list of all the 
irreducible polynomials over Z/ 2 of degree less than 5: 


Degree l: X, X41; 

Degree2: X?7+X4+1; 

Destee 3° kX, Xe exes 

Deco 4d Ree eh, Kee eS ee ea 


. The ring Z[X] is not a principal ideal domain (compare Exercise 18 in 
Chapter 7). The goal of this project is to show that Z[X] is a unique 
factorization domain. In particular, the converse of Theorem 8.16(b) is 
false. Recall that Z[X] is an integral domain (Corollary 7.14, page 161). 


(a) Show that the group of units Z[X]* of ZX] is {+1}. 
(b) Show that each prime number is an irreducible element of Z[X]. 


(c) Define a polynomial 
P(X) = prX” + pnrX"* +++ + pr X + po 
of positive degree n in Z[X] to be primitive if py, > 0 and 
gcd(pn,---,po) =1. 


Show that each element f(X) of the ring Q[X] of polynomials over 
the rationals has a unique expression of the form 


F(X) = apps (X) 


with gy in Q and with p(X) as a primitive polynomial in Z[X]. 
The rational number gq; is known as the content of f(X). 
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(d) Show that ZLX] = {f(X) in Q[X] | qf in Z}. 

(e) For f(X) in Z[X], show that f(X) is primitive if and only if gf = 1. 

(f) Let p be a prime number. Let 0: Z — Z/,;n > n+ pZ denote 
reduction modulo p. Use the Substitution Principle to define a 
homomorphism 6x : Z[X] — Z/,|X]; f(X) > f(X) . Suppose that 
a polynomial f(X) in Z[X] has positive leading coefficient. Show 
that f(X) is a primitive polynomial if and only if f(X) is nonzero 
for each prime p. 

(g) Prove Gauss’ Lemma: the product of two primitive polynomials is 
primitive. 

(h) Let f(X) and p(X) be elements of Z[X], with p(X) primitive. If 
p(X) divides f(X) in Q[X], show that p(X) divides f(X) in Z[X]. 

(i) Let f(X) and g(X) be elements of Q[X]. If f(X) divides g(X) in 
Q[X], show that py(X) divides p,(X) in Z[X]. 

(j) Let f(X) and g(X) be elements of Z[X]. If f(X) and g(X) have 
a common nonconstant factor in Q[X], show that they also have a 
common nonconstant factor in Z[X]. 


(k) Show that if a nonconstant polynomial is irreducible in Z[X], it is 
also irreducible in Q[X]. 

(1) Let f(X) be an element of Z[X] with positive leading coefficient. 
Show that f(X) is irreducible in Z[X] if and only if it is a prime 
number, or is a primitive polynomial that is irreducible in Q[X]. 

(m) Show that each irreducible element of Z[X] is prime. Conclude 
that Z[X] is a unique factorization domain. 


. Eisenstein’s Criterion and cyclotomic polynomials. Section 7.6 


noted that it may be very tricky to decide whether a given polynomial is 
irreducible or not. One of the numerous tricks available is Eisenstein’s 
Criterion for the irreducibility over Q[X] of a polynomial f(X) from 
Z|X]. A polynomial 


F(X) = fa X" + frrX™ +--+ AX + fo (8.13) 
with integral coefficients fn, ..., fo satisfies the criterion if there is a 
prime number p for which 

Pt fn, DB Seis aa) P| fo, pt fo- (8.14) 


(a) Show that no constant integral polynomial can satisfy Eisenstein’s 
Criterion (8.14). 


(b) Suppose that an integral polynomial (8.13) satisfies Eisenstein’s 
Criterion (8.14). Suppose that there is a proper factorization 


F(X) = g(X)h(X) (8.15) 
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in Z[X]. Consider the corresponding factorization 
F(X) =G(X)A(X) (8.16) 


in Z/,|X] — compare (f) in Study Project 2. Show that (8.16) 
reduces to 7 so = 
F(X) = frX” =G(X)KX) 


in the unique factorization domain Z/,,[X]. Since 
X|g(X) and X | A(X) 


in Z/,[X], conclude that 


in Z/p or 
p|g(0) and p| A(0) 
in Z. Obtain a contradiction to the final condition of Eisenstein’s 
Criterion (8.14). 
(c) Using (k) in Study Project 2, show that an integral polynomial 
(8.13) satisfying Eisenstein’s Criterion is irreducible over Q[X]. 


(d) For a prime number p, the p-th cyclotomic polynomial is 


= XPT EXP? LEX 41 


— compare Exercise 31 in Chapter 7. Use Eisenstein’s Criterion, 
and Exercise 44 in Chapter 6, to show that ®,(1+.X) is irreducible 
over Q[X]. Conclude that ®,(X) is irreducible over QLX]. 


8.11 Notes 
Section 8.2 


A. “Emmy” Noether was a German mathematician who lived from 1882 to 
1935. 


Section 8.3 


Many treatments of factorization introduce the concept of a “Euclidean 
domain,” as an integral domain in which a version of the Division Algorithm 
is available. Elements of the domain are weighted, so that remainders have 
lesser weight than divisors. However, it appears that there is no complete 
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agreement between different sources about the multiplicativity properties that 
should be demanded of the weighting function. For this reason, we refrain 
from a discussion of Euclidean domains, merely noting the use of the Division 
Algorithm to show that Z, and rings of polynomials over fields, are principal 
ideal domains. 


Section 8.7 


Proposition 8.32 could have been derived from the structure theorem for 
finitely generated abelian groups (Theorem 9.56), while Proposition 8.33 may 
be proved by regarding EF as a finite-dimensional vector space over F' (compare 
Section 9.7). The proofs given here are designed to depend only on concepts 
that have already been encountered in the text. 


Section 8.8 


E. Galois was a French mathematician who lived from 1811 to 1832. 


Section 8.10 


Eratosthenes (Epatoodevns) was a Greek mathematician and geographer 
who lived from around 276 to 194 B.C. 

F.M.G. Eisenstein was a German mathematician who lived from 1823 to 
1852. 


Chapter 9 


MODULES 


In Chapter 6, rings were introduced axiomatically (Definition 6.1, page 127), 
based on the examples of the integers Z, the reals R, and the ring of 2 x 2 
real matrices. On the other hand, the axiomatic definition of an abstract 
group (Definition 4.14, page 71) was founded on the general class of concrete 
groups of permutations, while Cayley’s Theorem (Section 5.6) showed that 
each abstract group is isomorphic to a group of permutations. In this chapter, 
we investigate the corresponding general class of concrete rings, namely rings 
of endomorphisms (of abelian groups), and the accompanying concept of a 
module. For fields, modules are just vector spaces. For the ring of integers, 
modules are just abelian groups. Thus modules capture the features that are 
common to vector spaces and abelian groups, providing a general context for 
the pervasive phenomenon of linearity. 


9.1 Endomorphisms 


Suppose that (A,+,0) is an abelian group. An endomorphism 6 of Aisa 
group homomorphism @: A — A from A to A. (The prefix “endo-” suggests 
that after leaving the domain A, the homomorphism 6 goes back in to A 
again, as a codomain.) Endomorphisms are not required to be injective or 
surjective. In particular, the constant map 


04: A> Ajar0 (9.1) 


sending each element of A to 0 is an endomorphism of A, known as the zero 
map. Another endomorphism of A is the identity map id, on A, often written 
as 

14: A— Ajsarva (9.2) 


using the symbol 1, in parallel with the use of 0 in (9.1). 


Example 9.1 (Endomorphisms of integers modulo 2.) 
Consider the abelian group (Z/2,+,0) of integers modulo 2 under addition. 
The only endomorphisms of (Z/2,+,0) are the zero map (9.1) and the identity 
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map (9.2). Indeed, an endomorphism 6 of Z/2 is the zero map if 0(1) = 0 
and is the identity map if @(1) = 1. 


d 


Example 9.2 (Endomorphisms of the trivial group.) 
Consider the trivial abelian group {0}. The only endomorphism of {0} is the 
zero map (9.1), which coincides with the identity map (9.2) in this case. {J 


Example 9.3 (Multiples.) 
Let (A,+,0) be an abelian group. For each integer n, the map 
lin: A> Asare na, (9.3) 


sending a group element a to its multiple na, is an endomorphism of A. Taking 
the integer n = 0 gives the zero map (9.1). Taking the integer n = 1 gives the 
identity map (9.2). 


Example 9.4 (Left multiplication in rings.) 
Let (R,+,-) be a ring, with additive group (R,+,0). Let r be an element of 
R. As discussed in Section 6.2, the left multiplication 


R-Ravr-s« (9.4) 
by r is an endomorphism of (R,+,0). 
(a) For r = 0, the left multiplication (9.4) is the zero map (9.1). 


(b) If R is unital, and r = 1, then the left multiplication (9.4) is the identity 
map (9.2). 


Example 9.5 (Scalar multiplication.) 
Consider the abelian group R} of 2-dimensional real column vectors with 
componentwise addition. For each real number 4, the scalar multiplication 


A:RE OR 2] = bel 
2 


Axe 
is an endomorphism of R3. J 


Given two real scalars \ and yz in Example 9.5, we have the equality 
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for vectors 


in R}. More generally, the difference 0 — y of two endomorphisms 6 and ¢ of 
an abelian group A is given by 


(0 — v)(a) = O(a) — v(a) (9.5) 


for each element a of A. 


PROPOSITION 9.6 (The difference of two endomorphisms. ) 
Let A be an abelian group. Then the difference (9.5) of two endomorphisms 
0 and ~ of A is again an endomorphism of A. 


PROOF For elements a and b of A, we have 


(@— y)(at+ b) = O(a + db) — yp(a +b) 
= O(a) + 0(b) — vla) — v(0) 
= O(a) — pla) + 6(0) — v(0) 
= (4 — y)(a) + (6 — 9) (0). 


The first and last equalities use the definition (9.5) of the difference. The 
second equality holds since 6 and y are group homomorphisms. The third 
equality holds since A is abelian. 


COROLLARY 9.7 (The additive group of endomorphisms.) 
Consider the group AA of all maps from A to (A,+,0), equipped with the 
componentwise abelian group structure (A4,+,0) given by Definition 4.34(c). 


Then the set of all endomorphisms of the abelian group A forms a subgroup 
of AA. 


PROOF The set of endomorphisms of the abelian group A is nonempty, 
since it contains the zero endomorphism (9.1). Proposition 9.6 shows that 
the set of endomorphisms satsifies the closure property required by Proposi- 
tion 4.43 (page 80) for a subgroup — compare Remark 4.44. 


DEFINITION 9.8 (The endomorphism group End A.) The set of 
endomorphisms of an abelian group (A,+,0) is denoted by End(A,+,0), or 
just End A. As a group with the structure given by Corollary 9.7, it is known 
as the endomorphism group (End A,+,0) of the abelian group A. 


Along with its closure under the subtraction (9.5), the endomorphism set 
End A of an abelian group A is also closed under functional composition. 
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Indeed, if 6 and y are endomorphisms of A, then so is 6 o y — compare 
Exercise 4 in Chapter 5. The full structure is described as follows. 


THEOREM 9.9 (The endomorphism ring.) 


Let (A,+,0) be an abelian group. Then the set End A of endomorphisms of A 
forms a unital ring (End A,+,0). The additive group structure (End A, +, 0) 
follows Definition 9.8: 


(0+ v)(a) = O(a) + vla) 


for 0, yp inEndA anda in A. The monoid structure (End A,o,1) is given by 
functional composition: 


(8.0 y)(a) = (y(a)) 


for 6, p in End A anda in A. The zero element 0 is the zero map (9.1), while 
the unit element is the identity map (9.2). 


PROOF In order to obtain (End A,+,0) as a ring, the distributive laws 
must be verified. Consider endomorphisms 6, y, and w of A. Then for each 
element a of A, we have 


90 (p+)(a) = O(p(a) + ¥(a)) 
=00 (a) +00 y(a) 
= (009+ 0¥)(a) 


and 


(p+ W) 0 0(a) = (p + 4) (0(a)) 
= 90 A(a) + 2 0 0(a) 
= (pod +po8)(a). 


Thus the left distributive law 00(y+w) =00p+6oy and right distributive 
law (p+ v)00=p004+ W098 hold in End A, as required. 


For an abelian group A, the ring End A or (End A,+,0) of Theorem 9.9 is 
known as the endomorphism ring of A. 


Example 9.10 (Groups of order 2.) 


Let A be an abelian group of order 2. Then the endomorphism ring End A 
is isomorphic to the ring of integers modulo 2 (Exercise 7). (] 
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9.2 Representing a ring 


Cayley’s Theorem shows that an abstract group (G,-,1) is isomorphic to a 
subgroup of the symmetric group G! on the underlying set G. Here are the 
analogous results for rings. 


THEOREM 9.11 
Let (R,+,-) be a ring, with additive group (R,+,0). For each element r of 
R, consider the left multiplication 


Ar: RO Rae r-s 
by r. Then there is a ring homomorphism 
A: R— End(R,+,0);r A; (9.6) 


from R to the endomorphism ring of the additive group (R,+,0) of R. 


PROOF For elements r, s, and x of R, we have 
(Ap + As)(@) = Ap(@) +As(@) =r e+s-c=(r+s)-@ = Ap4s(z) 
by the left distributive law and 
Apo As(4) = 7r- (8-2) =(r-s)- 4 = rp5(x) 


by the associative law, so that A is a ring homomorphism. l] 


COROLLARY 9.12 (Cayley’s Theorem for unital rings.) 
Let (R,+,-) be a unital ring. Then the map 


A:R-— End(R,+,0) 


of (9.6) is an injective homomorphism of unital rings. 


PROOF For elements r and s of R, the equation A, = A, implies 


r=r-1l=4,(1)=A,(1)=s-l=s, 


so the map 
A:R— End Rr A, 


is injective. Moreover, A; is the identity map idr, so that A is a unital ring 
homomorphism. 
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Example 9.13 
If (R,+,-) is not unital, it may happen that the ring homomorphism 


A:R— End(R,+4,-);7 + A, 


of (9.6) is not injective. Consider the subring R = {0,2} of the ring of 
integers modulo 4. Since 2.2 = 0 = 2-0, the left multiplications Az and Ag 
coincide. In this case, the image of (9.6) is trivial, consisting only of the zero 
endomorphism. 


9.3. Modules 


Let R be a unital ring. According to Corollary 9.12, there is a unital ring 
homomorphism 
R— End(R,+,0) 


from R to the endomorphism ring End(R, +, 0) of the additive group (R, +, 0) 
of R. In this case the abelian group (R,+,0) is directly obtained as part of 
the ring structure on R. However, we may just as well consider a unital ring 
homomorphism 

R— End(A, +, 0) 


from R to the endomorphism ring End(A,+,0) of an arbitrary abelian group 
(A,+,0). 


DEFINITION 9.14 (Modules with explicit structure map.) Let R 
be a unital ring, and let (A,+,0) be an abelian group. Suppose that there is a 
unital ring homomorphism 


a0: R— End(A,+,0);r- o, (9.7) 
from R to the endomorphism ring End A of the group A. 


(a) The abelian group A is described as a (unital) (left) module over R, or 
a (unital) (left) R-module (A,+,0¢), with structure map (9.7). 


(b) For each element r of R, the endomorphism 
op: A> A;ata,(a) 


is described as the action of the ring element r on A. 


REMARK 9.15 Since oj = id, in (9.7), we have 
o1(a) =a 


for each element a of A. (] 
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Example 9.16 (Scalar action.) 
Let F be a field. Consider the abelian group Fi} of 2-dimensional column 


vectors 
T1 
v2 
with componentwise structure. For each element 4 of F’, consider the action 
. pl 1, | %1 At 
0,: Fs > F;; ia = | 


of \ on F} by scalar multiplication (compare Example 9.5 for the real case). 
Then F} becomes a module (F},+,0) over F' with structure map 


o:F > End Fi;A4 oy 
(Exercise 8). (] 


Example 9.17 (Conjugated action.) 
Consider the field C of complex numbers. For each element » of C, define 


the action = 
alt 1, | 71 AL} 
ous Ca ai ie ai | 


of \ on Cd by scalar multiplication with the complex conjugate \ of A. Then 
C3 becomes a module (C4, +,6) over C with structure map 


6:C — EndCj; A Oy 
(Exercise 9). (] 


Example 9.18 (Abelian groups as Z-modules.) 
Let (A,+,0) be an abelian group. Example 9.3 noted that for each integer 
n, the map pf, : A > A;a+ na, sending an element a of A to the multiple 
na, is an endomorphism of A. Now consider the map 


yw: Z— End(A,+,0);n py. (9.8) 
Since 
pm-n(@) = (+ n)a = mat na = ftm(a) + pina) 
by the Law of Exponents (5.13), and 


Lmn(a) = (mn)a = m(na) = fm © bn (a) 


for elements a of A and integers m, n, with f1(a) = a = idy(a), the map 
(9.8) gives a unital ring homomorphism. Thus the abelian group (A, +, 0) is 
a left Z-module (A, +, 4). In fact, (9.8) represents the only possible choice for 
a structure map making A into a Z-module (Exercise 10). 
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Definition 9.14 specifies modules using the structure map (9.7) as a unital 
ring homomorphism. However, it is often easier to recognize the presence of 
a module directly. 


PROPOSITION 9.19 (Characterizing modules.) 
Let R be a unital ring, and let (A,+,0) be an abelian group. Then A is a left 
R-module if and only if there is a map 


op: A A;aro,(a), 
for each element r of R, such that the following properties are satisfied: 
a) o-(a+b) =a,(a) +,(b); 
b) o-+s(a) = o,(a) + 0;(a) ; 


) 
) 

(c) ors(a) = 0, 004(a); 
) 


forr, sin R anda, bin A. 


PROOF First, suppose that A is a left R-module, by virtue of a unital 
ring homomorphism (9.7). Then (a) holds since ¢, is an endomorphism of 
(A,+,0), while (b) and (c) hold since o is a ring homomorphism, and (d) 
holds since the ring homomorphism ¢ is unital (Remark 9.15). 

Conversely, suppose that the conditions (a) — (d) are satisfied. By (a), each 
map o,;, for r in R, is an endomorphism of (A,+,0). Thus a function 


a0: R— End(A,+,0);7r 6 oa; 


is specified. By (b) and (c), the function o is a ring homomorphism. By (d), 
the ring homomorphism ¢ is unital. 


Example 9.20 (Rings as modules over subrings.) 

Let R be a unital subring of a unital ring A. For each element r of R, and 
for each element a of A, define o,(a) = ra. Condition (a) of Proposition 9.19 
is satisfied by the left distributive law in the ring A. Condition (b) follows 
from the right distributive law. Condition (c) follows by the associative law, 
and (d) holds since A is unital. Thus A is a left R-module, denoted by ;A. 


Consider the scalar action of Example 9.16 for the field F = C of complex 
numbers, and contrast it with the conjugated action of Example 9.17. In each 
of these two cases, the abelian group C} of 2-dimensional complex column 
vectors becomes a module over C. Example 9.16 yields the module (C3, +,<), 
while Example 9.17 yields the module (C},+,5). When comparing the two 
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examples, it is critical to distinguish the structure maps. However, in most 
cases there is only one structure map in question, and there is no need to keep 
mentioning it explicitly all the time. Proposition 9.19 allows us to simplify 
the notation for an R-module. Instead of writing o,(a), we just write r-a or 
ra for a ring element r and module element a. The conditions (a) through (d) 
of Proposition 9.19 become the axioms (a) through (d) of the following defi- 
nition. Proposition 9.19 is then interpreted as saying that the new definition 
is consistent with the previous Definition 9.14. 


DEFINITION 9.21 (Modules with implicit structure map.) Let R 
be a unital ring, and let (A,+,0) be an abelian group. Suppose that for each 
element r of R, and for each element a of A, an element r-a or ra of A is 
defined. Suppose that the following properties are satisfied: 


(a) r-(a+b)=r-atrb; 


(b) (r+5)-aar-ate-a; 
(©) (rs)-a=4- (8-4); 
(d) l-a=a 


forr, sin Randa, bin A. Then the abelian group A is described as a (unital) 
(left) module over R, or a (unital) (left) R-module (A,+, R). 


9.4 Submodules 


Let R be a unital ring. Modules over the ring R are abstract algebras 
in their own right, just like groups, rings, or any of the other algebras we 
have been studying. Thus there is a concept of a submodule, analogous to 
subgroups or subrings. We use the notation of Definition 9.21. 


DEFINITION 9.22 (Submodules.) Let R be a unital ring. Let (A, +, R) 
be a left R-module. Then a subset B of A is said to be an R-submodule of A, 
or just a submodule of M, if B is a subgroup of the abelian group (A,+,0), 
and r-+b lies in B for each r in R and b in B. 


Using Remark 4.44 (page 80), we see that a subset B of an R-module 
(A,+, R) is a submodule if: 
e B is nonempty; 
e «—y lies in B if x, y lie in B; 


e r-ciliesin B forrin Rand z in B. 
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Example 9.23 (Ideals as submodules.) 

Let R be a unital subring of a unital ring A. According to Example 9.20, A 
becomes a left R-module A, with the action r-a defined by the multiplication 
- in the ring (A,+,-) for r in R and a in A. Each ideal J of the ring A forms 
an R-submodule of pA, since J is a subgroup of (A,+,0), while r- 7 lies in J 
(for r in R and j in J) by the absorptive property (6.16) of the ideal J. (] 


In the theory of modules, one of the most fundamental concepts is the 
notion of generating a submodule. In order to understand this concept, we 
begin with a simple proposition. (Compare Exercise 30 in Chapter 4.) 


PROPOSITION 9.24 (The intersection of submodules.) 
Let (A,+,R) be a module over a ring R. Suppose that I is a set, and that 
for each element i of I, there is a submodule B; of A. Then the intersection 


() Bi = {a | a lies in B; for all é in T} (9.9) 
tel 


is again a submodule of A. 


PROOF Write B for the intersection (9.9). 


e B is nonempty, since each submodule B; contains 0. 
e If a, and ag lie in each B;, so does a; — ag. 


e If a lies in each B;, so does r- a for each element r of R. 


DEFINITION 9.25 Let X be a subset of a module A over a ring R. 


(a) The submodule RX generated or spanned by X is the intersection of 
all the submodules of A that contain X. 


(b) An R-linear combination of elements of X is an element of A of the 
form 

ry typ tre tot... +7Tn-2n (9.10) 

for a natural number n, elements x1,---X%p of X, and elements r1,...Tn 

of R. The elements 11,...T of R are known as the coefficients of the 


linear combination. Ifn = 0, the linear combination (9.10) is understood 
to be the zero element 0 of A. 


Rephrasing Definition 9.25(a), the submodule RX generated by a subset 
X is the smallest submodule containing X. The following result provides an 
explicit description of the submodule generated by a subset of a module, in 
terms of linear combinations. 
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THEOREM 9.26 (The submodule generated by a subset.) 
Let X be a subset of a unital module (A,+,R) over a ring R. Then the 
submodule RX generated by X is the set 


{ri-apt...+7n +2, | nin N,r; in R,x; in X} (9.11) 


of all R-linear combinations of elements of X. 


PROOF Write L for the set (9.11) of R-linear combinations of elements 
of X. We must show that L = RX. 


e Each submodule B of A that contains X also contains each R-linear 
combination 
Tye Ly Pass bi Qs LH 


of elements 71, ..., Zn of X. (A formal proof proceeds by induction 
on the natural number n, using the closure properties for submodules 
given in Definition 9.22 — see Exercise 16.) Thus each such submodule 
B contains L. It follows that their intersection RX also contains L. 


e Conversely, we must show that L contains RX. Now since A is unital, 
each element x of X appears in L as the R-linear combination 1 - z. 
Thus LZ contains X. Furthermore, L is a submodule of A: 


— Lis nonempty, since it contains 0 as the linear combination (9.10) 
with n=0; 


— If L contains 
Cary 07 tee PT Bn 
and 
/ / / / / 
BHT Uy +... Fly Ly, 


then it contains x — vz’ as 


/ 


ree ayt... ttn Unt (H-ri) apt... + (Hr) + ahs 


— If LZ contains 
L771: Cit... +n: Ln, 


then it contains r- «x (for each r in R) as 
(rry) apt... + (1rTn) Xn. 


Since L is a submodule of A that contains X, it contains RX. 
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COROLLARY 9.27 (Closure under linear combinations.) 
A subset B of a module (A,+,R) over a ring R is a submodule of A if and 
only if each linear combination 


T12eyp Ese ee ET CH 


(with n in N, elements r1,..., 7% in R and x},..., fp in B) lies in B. 
PROOF Set X = B in Theorem 9.26. [| 


Example 9.28 

Let R be a unital ring, and let A be the ring R[Y] of polynomials over the ring 
R in an indeterminate Y. Taking R as the subring of constant polynomials, 
consider A as an R-module pA or (A,+, R) according to Example 9.20. Then 
for each natural number n, the R-submodule of A generated by the set 


PEE ie, A 
is the set of all polynomials 
oY Pye eee EG (9.12) 


of degree at most n. Note that in this example, the coefficients r; of (9.12) 
as a linear combination — in the sense of Definition 9.25(b) — are just the 
coefficients of (9.12) as a polynomial. 


Attention will soon focus on modules that are generated by a finite subset. 


DEFINITION 9.29 (Finitely generated modules.) A module (A,+, R) 
over a ring R is said to be finitely generated if there is a finite subset X of 
A such that A is generated by X. 


Example 9.30 (Column vectors.) 
For a field F, the F-module F} of 2-dimensional column vectors (compare 
Example 9.16) is finitely generated, say by the subset 


1 0 
O} {1} f- 
If the field F is infinite, note that Fj itself is an infinite set. [ 
Any finite module (A,+, R) is finitely generated (by its underlying set A). 


On the other hand, the module A defined in Example 9.28 is not finitely 
generated — see Exercise 19. 
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9.5 Direct sums 


Let R be a unital ring. We continue with the theme that R-modules form 
another kind of abstract algebra, analogous to groups and rings. Accordingly, 
given two R-modules (A,+,R) and (B,+,R), a function f : A > B is defined 
to be an R-module homomorphism (or just a module homomorphism) if 


fl@+y) = f(a) + fy) (9.13) 


and 
f(r-x)=r- f(z) (9.14) 

for rin R and x, y in A. The image f(A) of an R-module homomorphism 
F:A-— Bisa submodule of B (Exercise 20). By (9.13), each R-module 
homomorphism 

f : (Ales R) rane (B,+, R) 
is an abelian group homomorphism 

f :(4,+,0) — (B,4+,0). 


If R is the ring Z of integers, then (9.13) implies (9.14) — Exercise 21. 

For a unital ring R, bijective R-module homomorphisms are described as 
R-isomorphisms, or just isomorphisms if the context is clear. Two R-modules 
A and B are said to be isomorphic, a relation that is written as A & B, if 
there is an R-module isomorphism b : X — Y. In particular, two Z-modules 
A and B are isomorphic if and only if they are isomorphic as abelian groups. 

For groups and rings, the idea of imposing componentwise structure was 
used to define direct products. For R-modules, the same idea may also be 
applied. However, since there are special properties that hold in the case of 
modules, different terminology is used. (Compare Exercises 24 and 25.) 


DEFINITION 9.31 (Direct sums.) Let R be a unital ring. Let n be a 
positive integer. Let A; be an R-module, for 1 <i<n. Then the (external) 
direct sum A, ®---@ Ap or Di<i<n A; is the set 


{(a1,...,@n) | a; in A; for 1 <i<n} 
equipped with the componentwise abelian group structure 
(a1,...,@n) +(a},..-,@,) =(a1t+a),,...,a, +/,) 
and componentwise actions 
r+ (@1,---,@n) =("-Q4,...,7* Gn) 


for r in R. 
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Example 9.32 (Direct products of abelian groups.) 


If Ay, ..., Ay are abelian groups, then the direct sum 
Ai ®--+® An 
of the Z-modules Aj, ..., Ay is just the direct product 
A, X--+X Ap 
of the abelian groups Ai, ..., An (compare Section 4.4). We thus speak of a 


direct sum of abelian groups, as a synonym for the direct product. 


Example 9.33 (Column vectors.) 

Let F be a field. Consider F' as a left F-module rF with the structure map 
given by the ring multiplication, as in Example 9.20. Recall the F-module 
F} of 2-dimensional column vectors with scalar action, as in Example 9.16. 
Then the map 


PF © pF > Fy; (21,22) 6 | 
is an F-module isomorphism (Exercise 22). J 


The following result characterizes modules (such as F} in Example 9.33) 
that are isomorphic to direct sums. 


THEOREM 9.34 (Recognizing direct sums.) 
Let B be a unital module over a unital ring R. Let n be a positive integer. 
Then the following conditions on B are equivalent: 


(a) B is isomorphic to a direct sum 
Ai @AL®...BAn 
of modules Ay, ..., An; 


(b) There are submodules B,, ..., By of B such that each element b of B 
has a unique expression as a sum of the form 


b= bi +bo+...+dn (9.15) 
with b; in B; for 1 <i<n; 


(c) There are submodules Bi, ..., By, of B such that B is isomorphic to 
the direct sum By ®@ Bo @®...B By. 
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PROOF (c) implies (a): This is immediate. 
(a) implies (b): For 1 <i <n, let A; be the submodule 


slot i 
Ae = {(0,.-.40, “Gi, 0,.--50) | a; in Aj} 


of A, ®...@ Ap. Each element a = (a1,...,a@,) of A, ®...@ Ap has a unique 
expression (@1,...,@n,) = (a1,0...,0) +...+(0,...,0,a,) as a sum of the 
form 


a=@,+...+Gy (9.16) 


with a = (0,...,0,a;,0,...,0) in A; for 1 < i < n. Consider the given 
isomorphism as 


B:AL®...@An > B. (9.17) 


For 1 <i <n, define the submodule B; of B to be the image B(Ai) of the 
submodule A; of A; @...@ Ay. Since (9.17) is an isomorphism, each element 
of B has a unique expression of the form ((a) for a in A, @...@ Ay. Applying 
the isomorphism (3 to the unique expression (9.16) for a as a sum of respective 
elements G1, ..., @ of A,,..., Ay we obtain the unique expression 


B(a) = B@1) +... + BGn) 


of B(a) as a sum of elements of By, ..., Bn. 
(b) implies (c): If (b) holds, there is a well-defined isomorphism 


B Bi, ®B.®...8 Babi + bg +... + by > (b1, b2,..., Bn) 


of R-modules. (] 


COROLLARY 9.35 

If B is isomorphic to the direct sum A, ®...® An of modules Ay, ..., An, 
then the submodules B; of B, for which B is isomorphic to the direct sum 
By, ®...® Bn, may be chosen so that A; = B; for 1 <i<n. 


PROOF For 1 <i <n, define a; : Aj > Aj;a; + (0,...,0,a;,0,...,0). 
Then the desired isomorphism is 30 a; : A; — B,. (| 


DEFINITION 9.36 (Internal direct sums.) A module B over a ring 

R is described as an internal direct sum (of submodules Bi, ..., By) if tt 
satisfies the equivalent conditions of Theorem 9.84(b),(c) for some positive 
integer n. 


230 Introduction to Abstract Algebra 


Example 9.37 (Column vectors.) 
Let F be a field. Consider the F-module F} of column vectors discussed in 
Example 9.33. The module F} is the internal direct sum of the submodules 


a= {] [s} 


and 
0 . 
Bog = cinFk 
x 
(Exercise 23). l] 
Suppose that B,,..., B, are submodules of a module B over a unital ring 
R. The condition for B to be an internal direct sum of By, ..., By is the 


existence and uniqueness of the expression (9.15) for elements 6 of B. It is 
helpful to consider the existence and uniqueness questions separately. Write 


By, +Bot+...+Bn (9.18) 


for the submodule of B that is generated by the union of the subsets By, ..., 
B, of B. By Theorem 9.26, the set (9.18) comprises all the elements b of B 
that have a (not necessarily unique) expression of the form 


b= bi +bo+...+dn (9.19) 
with b; in B; for 1 <i <n (Exercise 26). Given the existence of expressions 


of the form (9.19), their uniqueness is characterized as follows. 


PROPOSITION 9.38 
For a positive integer n, let Bi, ..., By be submodules of a module B. The 
following conditions are equivalent: 


(a) The submodule By + By+...+B, of B is the internal direct sum of its 
submodules By, ..., Bn; 


(b) Each element b of By +...+ By has a unique expression of the form 
b= b) +bg+...+ by 
with b; in B; for 1 <i<n; 
(c) The zero element 0 of B has a unique expression of the form 
O=z4+22+...+ Zn 


with z,; in B; forl1<i<n. 


MODULES 231 


PROOF The equivalence of (a) and (b) follows directly by Definition 9.36. 

Condition (c) is just a special case of condition (b). It remains to show that 
(c) implies (b). Suppose that (c) holds. In that case, z; = 0 for 1 <i<n. 
Suppose that an element 6b of B; + ...+ B, has expressions 


b=ay+...¢ 2, =Yit---+ Yn 


with x;, yj in B; for 1 <i<n. Then 0= (2; —y1) +...+(@n — yn). By (c), 
we have x; — yj = 2; = 0 for 1 <i <n, so the expression for b is unique. 


9.6 Free modules 


We now begin the study of finitely generated modules. Let R be a unital 
ring. Using the construction of Example 9.20, the underlying abelian group 
(R,+,0) becomes a left R-module rR, with action given by the multiplication 
in the ring. Direct sums of a finite number of copies of rR, and modules 
isomorphic to such direct sums, are very important. They are recognized 
with the help of Proposition 9.38, using the concept of linear independence 
introduced in the following definition. 


DEFINITION 9.39 (Free modules and linear independence.) Let 
A be a module over a unital ring R. Let | be a natural number, and let 
X = {x1,...,x1} be an l-element subset of A. 


(a) An expression 
O=7rya,+...+ 712) (9.20) 


of 0 as an R-linear combination of the elements of X is said to be 
nontrivial if there is at least one nonzero coefficient 11, ..., Tt. 


(b) The set X is said to be linearly independent (over R) if there is no 
nontrivial expression (9.20) of 0 as a linear combination of the elements 
of X. (In particular, the empty set is linearly independent.) 


(c) If X is a linearly independent generating set for the R-module A, then 
A is described as the free R-module over the generating set X. In this 
case, the set X or sequence x1, ..., x, is also known as a basis for A. 


Example 9.40 (Standard bases.) 
Let 1 be a positive integer. For a nontrivial unital ring R, write R! for the 


direct sum 
1 summands 


a > 
RR®...PRR 
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of 1 copies of the module rR. For 1 <i <1, consider the element 


slot é 
—~ 
= (Oh aeg 0s. y- Oyesn, 0) 
of R'. The sequence €}, ..., €7, or the subset 


EF, =f{e,|1<i< 
of R', are both known as the standard basis of R'. Now R-linear combinations 
of standard basis elements are easily computed: 
rey +... + rie, = ("1,---,71) 


for coefficients r1, ..., 7; in R. Thus the set EF; is linearly independent, and 
R! is the free R-module over the standard basis E;. For the ring R of real 
numbers, the standard basis E3 of R? is illustrated in Figure 9.1. 


FIGURE 9.1: The standard basis E3 of R°. 


The trivial R-module {0} is the free R-module over the empty generating 
set. Expanding the notation of Example 9.40, it is convenient to denote {0} 
by R°. General free modules over nonempty generating sets are described as 
follows. 


PROPOSITION 9.41 (Structure of free modules.) 
Let R be a unital ring. Let A be a free R-module over a generating set 
X = {x1,..., x1} with a positive number | of elements. 
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(a) A is the internal direct sum of the submodules R{x,}, ..., R{az}. 
(b) A is isomorphic to the module R! of Example 9.40. 
(c) Each element a of A has a unique expression as an R-linear combination 


QA=Tyay +... +712]. 


PROOF (a): For 1 <i < 1, each element of R{x;} is of the form r;2; 
for some element r; of R. Since X is linearly independent, the submodules 
R{x1},..., R{ay} of A satisfy the condition (c) of Proposition 9.38. Since the 
module A is generated by X, it becomes the sum A = R{x,}+...+ R{ay}. 
By condition (a) of Proposition 9.38, it follows that A is the internal direct 
sum of the submodules R{x1},..., R{ax}. 

(b): For each 1 <i <1, there is an isomorphism 


Go: RRO R{ahire ra; 


from the module rR to the submodule R{x;} of A. Note that as an abelian 
group homomorphism, the map g; is injective. Indeed rx; = 0 implies r = 0, 
since the set X is linearly independent. 

(c) now follows by condition (b) of Proposition 9.38 and the existence of 
the isomorphisms g; for 1 <7 < l. 


The importance of free modules stems from the following. 


THEOREM 9.42 (Universality property of free modules.) 

Let A be a free R-module over a finite generating set X. Let 7 : X — A denote 
the inclusion of X as a subset of A. Then for each function f : X — B from 
X to the underlying set B of an R-module B, there is a unique R-module 
homomorphism f : A— B with f = f oj. 


PROOF Suppose that X is the l-element subset {21,...,2,} of A. For 
natural numbers k not exceeding /, we will prove by induction on k that an 
R-module homomorphism f : A > B with f = f oj has to satisfy 


F(riti +... + 7e~tR) = 11 f (1) +... 1K f (ee) (9.21) 


for coefficients 71, ..., 7% in R. The induction basis is the case k = 0 of (9.21), 
namely f(0) = 0. This must hold if f : (A,+,0) — (B,+,0) is to be a group 
homomorphism. For the induction step, we have 


f(riv + wd + Te—-1Uk-1 + TELE) 


= f(ryay +... + rp_ite—1) + f (rere) 


= f(rigi +... + 7K—10R-1) + ref (xx) 
= rif (or) bre fet) ef (We) 
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The first two equalities hold since f is required to be a module homomorphism. 
The last equality holds by the induction hypothesis, and by the consequence 
f (xx) = f(ax) of the requirement f = f oj. 

Since A is generated by X, the case k = | of (9.21) defines the function 
f:A— B uniquely as 


FR ob ey) = iF St) Fs Sa ey 


Note that f is well defined, by Proposition 9.41(c). It remains to check that 
the function f actually is a module homomorphism. But for elements r;, s;, 
r of R, we have 


(Xo rai t s voi) =] = (Se + “2 = 3c + si) f(x) 


i=l i=1 i=l 


= Son Xi + Sif ( L;) -_ 


SI 
—~ 
s 
= 
VS 
SI 
—~ 
& 

2 
VS 


w=1 t=1 

and 

= 1 opel 1 

fA @» rst) =f, ( De rnin) = S- rrif (a4) 

i=l i=l i=1 
1 I 
= rfted=7F( orem) 
i=1 i=l 

as required. J 


Example 9.43 (Freeness of the group of integers.) 

Consider the abelian group Z as the free Z-module Z! over the standard 
basis FE, = {1}. Theorem 9.42 states that for each abelian group G, and for 
each element x of G, ie., for each function f : {1} > G;1 + a, there is 
a unique group homomorphism f : Z — G with f(1) = 2. This assertion 
is just the special case of Theorem 5.26 (page 107), the universality of the 
group of integers, in which the target group G is abelian. In the notation of 
Theorem 5.26, the group homomorphism f is exp,. 


As an application of Theorem 9.42, we may compute the endomorphism 
ring of each finite cyclic group. Note the choice of the unital ring R here. 


PROPOSITION 9.44 (The endomorphism ring of a cyclic group.) 
Let n be a positive integer. Then the map 


End(Z/n,+,0) > (Z/ny+,°);0 = (1) (9.22) 


is a ring isomorphism. 
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PROOF The abelian group (Z/n,+,0) is the free Z/,-module over the 
standard basis {1}. Each endomorphism @ of the abelian group (Z/n,+,0) is 
a Z/n-module homomorphism 6: Z/n — Z/n. By Theorem 9.42, each such 
module homomorphism is uniquely specified as the extension f of a function 
f : {1} — Z/,. Thus the map (9.22) is a bijection. It is straightforward to 
verify that it is a ring homomorphism (Exercise 29). (] 


9.7 Vector spaces 


Modules over a field are traditionally known as vector spaces. Module 
elements are known as vectors, and submodules are subspaces. Vector spaces 
enjoy special properties that are not shared by general modules. In particular, 
it will transpire that every (finitely generated) module over a field is free. 

For a general unital ring R, Definition 9.39(b) introduced the concept of 
linear independence for an [-element subset X = {21,...,2:} of an R-module 
A. The stipulation that X has | elements means that there are no repeats in 
the sequence x1, %2,...,2 1. For the opposite concept of linear dependence, we 
do need to admit the possibility of repeats. 


DEFINITION 9.45 (Linear dependence.) Let A be a module over 
a unital ring R. Consider a sequence £1,%2,...,% of elements of A. The 
sequence is said to be linearly dependent if there is an expression 


O=rya, +... +7Terp 


of 0 as an R-linear combination of the elements of the sequence, with at least 
one of the coefficients r1, ..., rx in R being nonzero. 


Suppose that A is a module over a nontrivial unital ring R. Let x,...,7% 
be a sequence of elements of A. There are three trivial ways for the sequence 
to be linearly dependent: 


e If x; = 0 for some 1 <i < k, then the sequence is linearly dependent by 
virtue of the expression 0 = 1 - xj; 
e If the sequence contains a linearly dependent subsequence 2;,, ..., ©i,, 
say 
0 = Ty Vi, oa ae Tj, Vi, 
with some r;, # 0, then the same expression serves to show that the 
original sequence 21,..., 2, is linearly dependent; 


e If the sequence has a repeat, say x; = 2; for 1 <i <j <-k, then it is 
linearly dependent by virtue of the expression 0 = 1- a; + (—1)- aj. 
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A sequence 21, ..., 2 is linearly independent if it is not linearly dependent. 
Since there are then exactly | elements in the sequence, repeats being excluded, 
we recover the concept of linear independence of the set {21,...,2,} from 
Definition 9.39(b). 

The following result gives an indication of the special properties that hold 
for modules (vector spaces) over fields. 


PROPOSITION 9.46 (Linear dependence and linear combinations.) 
Let A be a vector space over a field F. Let x1, ..., x% be a sequence of 
nonzero elements of A. Then the following conditions are equivalent: 


(a) The sequence 21, ..., &, ts linearly dependent; 


(b) An element x; of the sequence, with j > 1, is a linear combination of 
the earlier members x1, ..., &j~1 of the sequence. 


PROOF (a) implies (b): Since x, ..., x, is linearly dependent, there 
is a relation 
O= ray t+... + 7j-12j-1 +752; 


with r; 40. Then 
Lj= —rj 123 cue are Tae ey Se 
expresses x; as a linear combination of the earlier members of the sequence. 
(b) implies (a): Suppose that 


Lj = M21 +... 475-12%5-1 . 


Then 
0= | aS ba Poet en Tj-12j-1 + (-l1)a; 


expresses 0 as a linear combination involving the nonzero coefficient —1. J 


Example 9.47 (Linear dependence without a linear combination.) 
In the Z-module Z/4 of integers modulo 4, take x; = 2 and x2 = 1. Then the 
sequence £1, £2 is linearly dependent, since 0 = 1- a1 + 2-22. Nevertheless, 
the element x2 is not a linear combination (multiple) of x. 


For the remainder of this section, we will consider modules (vector spaces) 
over a field F’. The following theorem uses Proposition 9.46 to show that every 
finitely generated vector space is free. The name comes from the way that a 
generating set may be pruned down to a linearly independent generating set, 
a basis. 
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THEOREM 9.48 (Pruning Theorem.) 

Let V be a finitely generated vector space over a field F. Suppose that Y = 
{yi,---,Ym} is a set of nonzero vectors in V that generates V. Then Y has 
a linearly independent subset {yi,,...,Yi,} which forms a basis of V. 


PROOF Suppose that X = {x1,...,2,} is a k-element subset of Y which 

still generates V. If this subset X is already linearly independent, it forms 
the required basis. Otherwise, x21,...,2% is a linearly dependent sequence 
of nonzero vectors. According to Proposition 9.46, there is some vector 2; 
in the sequence, with 7 > 1, that is linearly dependent on its predecessors. 
Then X’ = {21,...,%j;-1,%j41,..-, vg} is a (k—1)-element subset of Y which 
still generates V. The pruning process, passing from a generating set X to 
a smaller generating set X’, may be continued until a linearly independent 
generating subset is obtained. 


Example 9.49 (Eighth roots of unity.) 
Consider the set Y = {w” | 1 <r < 8} of 8-th roots of unity, the complex 
roots of the polynomial X® — 1 in Q|X] — Example 8.24, page 198 — with 


= 00s (7) + isin (7) = ty 
WwW = COs 4 7sin rr = . 


These roots are displayed in Figure 9.2. 


FIGURE 9.2: Eighth roots of unity. 
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The field Q(w) is the splitting field for X® — 1 over Q. Using the notation 
of Example 9.20, the extension field Q(w) of Q forms a module or vector 
space gQ(w) over Q. The set Y spans the Q-vector space Q(w). (In fact, 
Q(w) may also be characterized as the subspace of gC generated by Y.) By 
Theorem 9.48, Y may be pruned down to a basis for Q(w) over Q. Starting 
with the sequence 

ds toe ee cto ite? a, 


the elements w’ = —w°, w® = —w?, w® = —w, wt = —1 are pruned in turn. 


This leaves the linearly independent set {1,w,w,w?} as a basis for the vector 
space Q(w) over Q. 


THEOREM 9.50 (Exchange Theorem.) 

Let V be a vector space over a field F’, and let | be a positive integer. Suppose: 
(a) X = {21,..., 2%} is an l-element basis of V, and 

(b) Y = {y1,---; Ym} is an l-element, linearly independent subset of V. 
Then V is generated by Y. 


PROOF For each 0 < k < I, we will prove that there is an injective 
function f:{k+1,...,J} — {1,...,0} such that the set 


{W159 <5 Yes Rp epas sss FOS (9.23) 


generates V. The claim is true for k = 0, with f as the identity function on 
the set {1,...,/}. Suppose 


diate URS Eee ath 
generates V, with an injective function 
g: {k,...,} 3 {1,..., 0}. 
Then the vector y, of V may be written as a linear combination 
Ue Ti Pee Pela Ee gay Fee Peay (9.24) 


Now ry, = --: = rT; = 0 would contradict the linear independence of Y, so 
there is an integer k <m <1 with r,, 4 0. Rewriting (9.24) as 


Yk =T1Y1 + +e FT e-1Yk—-1 TTL g(k) H+ PmXg(m) +++. + TiLg(l) ; 
we see that %g(m) is expressed as the linear combination 


el =a = 
Pm TUYL m6) Um TR-1YkK-1 TT Vie 


-1 -1 241, -1 
—Tin ThLg(k) — +++ — Tm Pm—-12g(m—1) — Pm Tm41XLg(m41) — +++ — hm TZ g(h) - 
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Since the set {y1,.--,Yk—1,Tg(k)s---»Tg1)} generates V, it is apparent that 


the set {yis.25+, Yes Ta Gep1)9--+ +4 Vem) Vet) i= 9 %aqy f also-generates: V. 
Define an injective function h: {k+1,...,J} - {1,...,]} by 


oe ae ifi<m; 


g(t) ift>m. 


Then {y1,---,Yk;n(k+1)>-++> Laci} Generates V, as required to complete the 
inductive proof. The statement of the theorem follows, since Y is just the 
case k = 1 of (9.23), with f: @ — {1,...,U}. 


COROLLARY 9.51 


Let V be a finitely generated vector space over a field F. Then any two bases 
of V have the same number of elements. 


PROOF Suppose that X = {x,...,a;} and Y’ = {yi,...,Yn} are bases 
of V over F’, with | and n elements respectively. If 1 4 n, say | < n, then 
Theorem 9.50 shows that the linearly independent subset Y = {yi,..., yw} of 
Y’ already generates V. The expression of the nonzero vector y+ as a linear 
combination of 41,...,y, would then violate the linear independence of Y’. (] 


DEFINITION 9.52 (Dimension of a vector space.) Let V be a finitely 
generated vector space over a field F. Then the dimension dimp V of V over 
F is the number of elements in a basis of V over F. 


Example 9.53 
In Example 9.49, dimg Q(w) = 4. l] 


Example 9.54 
For a field F' and positive integer n, the vector space F” has n elements in 
its standard basis E,, (Example 9.40). Thus dimp F” = n. 


Since finitely generated modules over a field F' have finite dimension, they 
are usually known as finite-dimensional vector spaces. If a vector space is not 
finite-dimensional, it is described as being infinite-dimensional. For the proof 
of the following, see Exercise 34. 


PROPOSITION 9.55 
Let V and W be finite-dimensional vector spaces over a field F. Then V and 
W are isomorphic if and only if they have the same dimension. 
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9.8 Abelian groups 


This section analyzes the structure of finitely generated abelian groups. 
Each such group is an internal direct sum of certain cyclic subgroups, given 
in terms of an ascending chain of ideals of Z — compare (8.3), page 188. 


THEOREM 9.56 (Structure of finitely generated abelian groups.) 
Let A be a finitely generated, nontrivial abelian group. Then there is a finite 
ascending chain 


Jy S Jo S cera Ji-1 iS JI (9.25) 


of proper ideals of Z such that A is isomorphic to the direct sum 
Z/Ji OZ/JIo8...0LZ/A10Z2/N (9.26) 


of nontrivial cyclic groups. 


PROOF Consider the set 
Sy = {|X| < co | ZX = A} 


of sizes of the finite subsets X of A that generate A. Since A is finitely 
generated, the set 5 of positive integers is nonempty. By the Well-Ordering 
Principle, the set S; has a least element /. This integer / will become the 
number of ideals in the chain (9.25). The proof proceeds by induction on J. 
If 1 = 1, the group A is cyclic, and the theorem is immediate. Thus for the 
inductive proof, we proceed to the case | > 1, and assume that the theorem 
is true for all abelian groups generated by fewer than | elements. 

Suppose that some generating set X = {x ,...,x,} of minimal size | is 
linearly independent, so that A is free. Proposition 9.41 then gives 


A2Z/J, pZ/ Jo DY eas WABI 
with J; =--- = J; = {0}, and the theorem is proved directly. 
In the remaining cases, each generating set X = {21,...,2,} of minimal 


size | satisfies a relation 
O=ryay,t+...+ 721 (9.27) 


with at least one nonzero coefficient integer r;. Multiplying (9.27) by —1 if 
necessary, we may assume that there is a strictly positive coefficient. Let S2 be 
the nonempty set of all such positive coefficients, taken over all the generating 
sets of minimal size |. By the Well-Ordering Principle, the set Sz has a least 
element. This least element appears as the coefficient r; in the relation (9.27) 
on a generating set X = {x1,..., x7}. 
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For 1 <i < l, the Division Algorithm with divisor r; gives r; = qr) + 1%, 
with remainder 0 < ri < r;. Rewriting the relation (9.27) as 


0= T{Xy see TE-1Vj-1 Ti Ly TE41Vi4+1 wae TX] 
/ 
HEH rr, see Ty-1%j-1 (Girl + 1); + E41 Viti +... FTL] 
/ 
STM eee TM-1Uj-1 TTX M441 Vi41 wae rar + girs) 


gives r/ as a coefficient in a relation on the generating set 
{His ty Baye Vit eet Geta} 


— note that x; = (—q;)a:+ (2: +q@2;). By the choice of r; as the least positive 
coefficient, we must have ri = 0, so that rj = qirt. 
Now consider the minimal generating set X’ = {x,..., 2-1, y} with 


YWrHNnt1 +... + UN-121-1 + XT. 
The relation (9.27) is rewritten as 


O=ryay... +7141 + 7121 
= QMrt1 +... + @-17iL1-1 + TIL] 
=ri(qtit...+q@-121-1+ 21) =riy, 


so the element y; has order r; in A. Setting J; = 11Z, we have Z/J; = Z{y}. 
Let B be the subgroup of A generated by {21,...,2;-1}. Note that A = 

B+Z{y}. Suppose that 
O0O=b+ SLY (9.28) 


with b = s;a7, +...+ 8)-12)_; in B and s;y, in Z{y,}. Use the Division 
Algorithm to write s; = pr; +r with 0 <r <r. Recalling that r;y = 0, the 
relation (9.28) takes the form 


0 = b4 sey = ($141 +... + 8;-141-1) + rw. 


By the choice of r; as the least element of the set S2 of positive coefficients, 
it follows that r = 0, so b = 0 = 5)y, in (9.28). By Proposition 9.38(c), the 
group A is the internal direct sum of B and Z{y}. 
Since B is generated by | — 1 elements, the induction hypothesis yields an 
ascending chain 
Ji — Jo Pea 5 Ji-1 (9.29) 


of ideals of Z such that B is isomorphic to the direct sum 
Z[J, @Z/JIo®...BZ/I-1 


of nontrivial cyclic groups. It follows that A, as the internal direct sum of B 
and Z{y:}, is isomorphic to the direct sum (9.26). 
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By Corollary 9.35, there is a generating set {y1,...,yi-1} of B such that 
Z/J, = Z{y:} for 1 <i<l-1. If B is free, then Jj_1 = {0}, so the chain 
(9.29) extends to (9.25). If B is not free, then Jj_1 = sZ with 0 < s =qri+r’ 
and 0 < r’ < r,. Consider the generating set Y = {y1,.--,y—1, @yi-1 + wi} 
for A — note y = (—q)yi-1 + (m-1 + m). Since 0 = sy_1 = Tim, we have 


0=(qritr’)y-1 trim =r’ yi-1 + ri(qy-1 + 2) 


as a relation on the minimal generating set Y of A. By the choice of r; as the 
least element of the set Sg of positive coefficients, it follows that r’ = 0. Thus 
s = qr lies in Jj, and the chain (9.29) extends to (9.25) in this case as well. [] 


Example 9.57 

Consider the abelian group A = C3 x C4 x Cs x Cg. By the Chinese Remainder 
Theorem (Theorem 5.24, page 105), C3 x C4 x Cs & Coo, so A = Coo X Ce. 
Theorem 9.56 gives A = Z/J, ®@ Z/Jo with J, = 60Z and Jo = 6Z. 


Example 9.58 
Consider the subgroup 


N = {(0, 10x, 22) | in Z} 


of Z°. Let A be the quotient group Z?/N. For the standard basis E3 = 
{e1,€2,e3} of Z, consider the cosets x; = e; + N in A for 1 <i < 3. The 
generating set X = {x1, 22,23} of A is minimal, and satisfies the relation 


10r2 + 2x3 = (0,10,2)+N=N=0. (9.30) 


Using the notation from the proof of Theorem 9.56, the least element of the set 
Sy of positive coefficients is r3 = 2, the coefficient of x3 in the relation (9.30). 
If B is the subgroup of A generated by {x1, x2}, we have A= BO@Z{ys} with 
y3 = 5a + #3 and Z{y3} = Z/2. In turn, B = Z@ Z (Exercise 36). Thus the 
chain of ideals in the description of A provided by Theorem 9.56 is J; = {0}, 
Jz = {0}, and Js = 2Z. 


Example 9.59 
Consider the subgroup 
N = {(0,10z,2y) |x, y in Z} 


of Z°. Let A be the quotient group Z°/N. Then the chain of ideals in the 
description of A provided by Theorem 9.56 is Jy = {0}, Jo = 10Z, and J3 = 2Z 
(Exercise 37). (] 


9.9 


1. 
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Exercises 


Determine all the endomorphisms of the group (Z/3,+,0) of integers 
modulo 3 under addition. 


. Let A and B be abelian groups. Show that the maps 


p: Ax B= Ax B;(a,b) & (a,0) 


and 
0:Ax B- Ax B;(a,b) + (0,b) 


are endomorphisms of A x B. 


. Let A be an abelian group. Show that the map 


7: AxA—A-x A;(a,b) & (b,a) 


is an endomorphism of A x A. 


. Determine all the injective endomorphisms of the Klein 4-group. 


. Let r be an element of a ring R. Which ring property in Definition 6.1 


(page 127) ensures that (9.4) is an endomorphism of the additive group 
(R,+,0)? 


. Let (G,-,e) be a group. Consider the group Z of integers. For two 


homomorphisms 6: Z — G and y: Z — G, define the componentwise 
product 
0-p:Z2> Gin O(n)y(n). 


Show that G is abelian if and only if the following condition is satisfied: 


For all homomorphisms 6 : Z — G and y : Z — G, the 
componentwise product 0-y: Z — G is a homomorphism. 


. Let A be an abelian group of order 2. 


(a) Show that there are just 2 endomorphisms of A, the zero map 0 of 
(9.1) and the identity map 1 of (9.2). 


(b) Show that 1+ 1 =0 in the endomorphism ring End(A). 


(c) Conclude that the ring End(A) is isomorphic to the ring (Z/2, +,-) 
of integers modulo 2. 


. In Example 9.16, verify that the map 


F > End (Fy,+);A% oy 


is a unital ring homomorphism. 
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In Example 9.17, verify that the map 
C > End (C3, +);A + 6) 
is a unital ring homomorphism. 


Let A be an abelian group. Suppose that A forms a Z-module (A, +,¢) 
with structure map 0 : Z— End A. Prove that o,(a) is the multiple na 
for each integer n and element a of A. 


Let A be an abelian group, interpreted as a Z-module according to 
Example 9.18. Show that a subset B of A is a subgroup of A if and only 
if it is a submodule of A. 


For a field F, consider the module Fy of 2-dimensional column vectors 
given by the scalar action of Example 9.16. For each 2-dimensional row 
vector [fi fo] over F', show that the solution set 


o={ [2] [ua f]=o} 


of the homogeneous linear equation 
fitr + fore =0 
is a submodule of F}. 


Continue the notation of Exercise 12. Let k be a nonzero element of the 
field F’. Under what conditions is the solution set of the inhomogeneous 
linear equation 

fivr + fot =k 
a submodule of F}? 


Let (R,+,-) be aring. A subset J of R is said to be a left ideal of R if 
it is a subgroup of (R,+,0) satisfying the left absorptive property 


jin J implies r-jin J 


for each r in R. Take R to be a left R-module, as in Example 9.20 with 
A= R. Show that a subset B of R is a submodule of R if and only if it 
is a left ideal of R. 


Let R be the ring R3 of 2 x 2 real matrices, under the usual addition 
and multiplication of matrices. Consider the subset 


= {foe 


Ls yin r} 


of R. 


16. 


17. 


18. 
19. 


20. 
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22. 


23. 
24. 
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(a) Show that J is a left ideal of R. 
(b) Show that J is not an ideal of R. 
Suppose that X is a subset of a submodule B of a module A over a ring 


R. Write out a formal proof, by induction on the natural number n, 
that each R-linear combination 


T1 Bie Flt on 


of elements x1,..., &, of X, with coefficients r1, ..., r, from R, lies in 
the submodule B. 


Consider the abelian group Z of integers as a unital Z-module. Show 
that 
Z{a,b} = gcd(a, b)Z 


for nonzero integers a and b. 
Give a direct proof of Corollary 9.27 (without using Theorem 9.26). 


Let R be a nontrivial ring. Consider the polynomial ring A over R in an 
indeterminate Y as an R-module (A,+, R), according to Example 9.28. 
Let 


X = {pi(Y),...,pr(Y)} 
be a finite subset of A. 
(a) Let M be the maximum of the degrees deg p,, ..., deg p, of the 
polynomials in the set X. Consider the submodule RX of A that 


is generated by X. Show that no polynomial p(X) in RX has a 
degree deg p which exceeds M. 


(b) Conclude that the R-module A is not finitely generated. 


Show that the image of a module homomorphism is a submodule of its 
codomain. 


Let (A,+,Z) and (B,+,Z) be Z-modules. Show that a given function 
f:A-— Bisa Zmodule homomorphism if and only if it is an abelian 
group homomorphism. 


Verify the claim of Example 9.33. 

Verify the claim of Example 9.37. 

Let A and B be abelian groups. Define group homomorphisms 
a:A—A®B;at (a,0) 


and 
B:B>A@®B;b4 (0,0). 
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Let C be a third abelian group, the codomain of group homomorphisms 
f:A—-Candg:B-C. Show that there is a uniquely defined group 
homomorphism h: A® B— C withhoa=f andhoB=g. 


Define group homomorphisms 
a:Z—>Z x Z;n (n,0) 


and 
B:Z>32Zx Zn (0,n). 


For the symmetric group S3 = {0,1,2}!, define group homomorphisms 

f:Z— S3;n' (0 1)” 
and 

g:Z— S3:n (1 2)”. 
Show that there is no group homomorphism h : Z x Z — $3 with 
hoa=f andhof=g. (Hint: In Z x Z, the equation 

(1,0) + (0,1) = (1,1) = (0,1) + (1,0) 
holds, while 
(0 1)0(1 2) 4 (1 2)0(01) 

in S3.) 


For a positive integer n, let B,, ..., B, be submodules of a module B. 
Verify that the set of all elements b of B that have an expression of the 
form 

b=b) + bo +...4+ by 


(with b; in B; for 1 < i <n) is the submodule of B generated by the 
union of the subsets B,,..., B, of B. 


Let X = {1,...,2,} be a basis for a free module A over a unital ring 
R. For 1 <i <1, define 


Yi=Ut+...+ 7. 
Show that Y = {y1,..., yi} is a basis for A. 


Let R be a nontrivial unital ring. Let / and m be integers. Show that 
the two R-modules R!' 6 R™ and R'+™ are isomorphic. 


Verify that the map (9.22) is a ring homomorphism. 


Let n be a positive integer. Show that the abelian group Cy, x C,, has 
n* endomorphisms. 


3l. 


32. 


33. 


34. 


35. 


36. 


37. 
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In the Z-module Z/¢ of integers modulo 6, take 7; = 2 and x2 = 3. 


(a) Show that the sequence x1, 22 is linearly dependent. 


(b) Show that neither of the elements 21, x2 is a linear combination of 
the other. 


Consider 
= (=) +isin (=) 
Ww = cos 3 3 


as a sixth root of unity. Find a basis for the vector space Q(w) over Q. 


cos an +isi an 
w= ns asim = 
5 5 


as a fifth root of unity. Find a basis for the vector space Q(w) over Q. 


Consider 


Prove Proposition 9.55: Let F be a field. 


(a) Let V be a finite-dimensional vector space over F’. If dimp V = l, 
let X = {x1,..., x21} be a basis for V. Define 


fi: XS Bry e 
and 
g: by > X;e,h aj. 


Use the universality of free modules (Theorem 9.42) to build the 
respective extensions f : V — F” and g: F” — V. Show that f 
and g are mutually inverse vector space isomorphisms. (Hint: The 
unique homomorphic extension idx of idx is idy.) 

(b) Let J and m be positive integers. If h: F! > F™ is a vector space 
isomorphism, show that h(E;) is a basis of F’™. Use Corollary 9.51 
to conclude that | =m. 


Let A be the abelian group C2 x C3 x C4 x Cg. Determine the chain 
(9.25) of ideals in the description of A provided by Theorem 9.56. 


In Example 9.58, show that each element (a), a5,a3) + N of A has an 
expression of the form 


(a1, @5,a3) + N = (aj, a2,0) + (0,5a3, a3) + N 


121 + AgX2 + a3y3 


l| 


with integers a, and ag. Conclude that the subgroup B is isomorphic 
toZ@Z. 


In Example 9.59, show that the group A is isomorphic to Z®Z/19 BZ/2. 
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Consider the subgroup 


N = {(0, 22, 2y)|2, y in Z} 


of Z°. Let A be the quotient group Z?/N. Find the chain (9.25) of 
ideals in the description of A provided by Theorem 9.56. 


(a) Show that the sets Z and Z? are isomorphic. 
(b) Show that the groups Z and Z? are not isomorphic. 


9.10 Study projects 


1. 


Partitions and p-groups. A partition (or more explicitly, an integer 
partition) is an expression of a positive integer n as a sum 


n=nytnoet...+¢n, (9.31) 


of positive integers. The integers n; are known as the parts or summands 
of the partition. Conventionally, the summands may be arranged in 
decreasing order: 

my 2Ng2 72M. (9.32) 


The number | of summands is known as the length of the partition. The 
integer n is known as the sum of the partition. 


FIGURE 9.3: The partition 4+4+3+1+1. 


It is sometimes helpful to visualize a partition (9.31) as a series of | 
columns, of respective depths given by the parts of the partition, as 
illustrated in Figure 9.3. 
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For a prime number p, a finite group is said to be a p-group if its order is 
a power p” of the prime number p. Here, we are concerned with abelian 
p-groups. Each partition (9.31) determines the p-group 


Z/p ZG Z/p’Ze---OZ/p"Z. 


The Structure Theorem 9.56 for abelian groups then yields the ascending 
sequence 


p'Z « pez « s agchte p'Z 
of ideals of Z corresponding to the decreasing sequence (9.32) of parts 


of the partition. A group A is said to be an elementary abelian group if 
it is isomorphic to a finite power Ch of a cyclic group of prime order p. 


(a) Show that the underlying group of a finite-dimensional vector space 
over the field Z/, is an elementary abelian p-group. 


(b) Show that every elementary abelian p-group A of order p! is the 
underlying group of an /-dimensional vector space over Z/,. 


(c) Show that an elementary abelian p-group of order p! has p! — 1 
elements of order p. 


(d) Show that an abelian group of order p! is elementary abelian if it 
has p!' — 1 elements of order p. 


(e) Show that an elementary abelian p-group of order p! corresponds 
to the partition /=1+1+...+1. 


(f) Let A be an abelian p-group of order p”. Using the notation of 
(9.3), consider the endomorphism pu, : A + A;at> pa of A. Show 
that A corresponds to a partition of n with length / if and only if 
the subgroup 


Ker pp, = {a | pa = 0} 
is an elementary abelian p-group of order p!. 


(g) Use induction on n to show that two p-groups A and B of order p” 
are isomorphic if and only if they correspond to the same partition 
(9.31) of n: 


(i). If A and B are elementary abelian, show that the claim results 
directly from (f). 
(ii). Otherwise, consider the subgroup 


pA = {pa | ain A} 
of A. If A corresponds to (9.32) with 
ny >t > Mm > Mme = = M=1, 
show that pA corresponds to the partition 
(ny —1)+...+ (Mm — 1) 


of n—l. 
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(h) Determine all the integer partitions of 5. 

(i) Determine representatives for each of the isomorphism classes of 
abelian groups of order 32. 

(j) Determine the isomorphism class of the group of units (Z/s51,-,1)* 
of the monoid of integers modulo 51 under multiplication. 

(k) Do the monoids Z/s5; and Z/15 x Z/5 have isomorphic groups of 
units? 


2. Algebraic and transcendental numbers. A complex number z is 


said to be algebraic if the field extension Q(z) is a finite-dimensional 
vector space over Q. Otherwise, x is described as transcendental. For 
example, it is known that the real numbers e and 7 are transcendental. 


(a) Show that \/5 is algebraic. 

(b) Show that « is algebraic if and only if it is the root of a certain 
polynomial p(X) in Q[X]. 

(c) Show that /3 + V5 is algebraic. 


(d) Consider successive field extensions F ~ K — L. Suppose that K 
is a finite-dimensional vector space over F’, with basis {k1,..., km}. 
Suppose that L is a finite-dimensional vector space over K, with 
basis {1,,...,l,}. Show that L is a finite-dimensional vector space 
over F’, with basis 


NS 


{ile (Les m, lagen}. 


Show that the set of algebraic numbers forms a subfield A of C. 
(Hint: If the dimensions dimg Q(z) and dimg Q(y) are finite, show 
that dimg/,) Q(x, y) and dimg Q(z, y) are finite. ] 


— 
lo) 
Ne 


. Algebraically closed fields. A field F is said to be algebraically closed 


if it satisfies one of the following conditions: 


(a) Each nonconstant polynomial f(X) in FX] has a root in F; 
(b) Each nonconstant polynomial f(X) in FX] splits over F; 
(c) If E is an extension of F' for which dim, E£ is finite, then E = F. 


Show that these conditions are equivalent. 


Now, if you know a little complex analysis, recall Liouville’s Theorem: 


If a function g : C > C can be expanded as a power series 


co 
S- An(z — Zo)” 
n=0 


about each point zo of C, and there is a real number M such 
that |g(z)| < M for all complex numbers z, then g: C > C is 
a constant function. 
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Use Liouville’s Theorem to derive the so-called Fundamental Theorem 
of Algebra: The field C is algebraically closed. [Hint: If f(z) 4 0 for all 
complex numbers z, apply Liouville’s Theorem to g(z) = 1/f(z).] 


9.11 Notes 
Section 9.6 
For certain noncommutative rings R, there may be an isomorphism 
Rix R™ 


with | Am. 


Section 9.7 


The fundamental Corollary 9.51 may also be proved using a result from 
the elementary theory of linear equations, that a homogeneous system of [ 
given equations in n unknowns has a nonzero solution if 1 < n. Traces of 
that method may be observed in the proof of Theorem 9.50, for example the 
selection of %g(m) as a “pivot” element in (9.24). 


Section 9.10 


J. Liouville was a French mathematician who lived from 1809 to 1882. 
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Chapter 10 


GROUP ACTIONS 


The importance of groups stems from the general way that they capture the 
concept of symmetry, as described in this chapter. In particular, each group 
contains various internal symmetries that constrain the structure of the group. 


10.1 Actions 


For a set X, recall that the set of all bijective maps from X to X forms 
a group X! of permutations under composition, the symmetric group on X 
(compare Section 2.7, and Exercise 13 in Chapter 4). For a group G, Cayley’s 
Theorem (page 110) shows that there is an injective group homomorphism 


A:G—>Glygr Ag (10.1) 


from G to G!. In order to study general symmetry, we replace the group G! 
in (10.1) by the group X! of permutations of a general set X. 


DEFINITION 10.1 (Group action, permutation representation.) 
Let X be a set, and let G be a group. Then the group G is said to have a 
(left) (group) action (X,G,.) on the set X, or to act on the set X, if there 
is a homomorphism 

A:G—XIgt Ag (10.2) 
from G to X!. The homomorphism X appearing in (10.2), or the full structure 
(X,G,), is said to be a permutation representation of the group G. 


Example 10.2 (Regular action.) 

As a permutation representation of a group (G,-,e), the homomorphism 
(10.1) from Cayley’s Theorem, with \,(x) = g- a for g and z in G, is called 
the (left) regular permutation representation. J 


Example 10.3 (Natural action of a symmetric group.) 
For each set X, the identity map idx; : X! — X! gives the natural action 
(X, X!,idx1) of the symmetric group X! on X. 
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Example 10.4 (Matrix action.) 
Left multiplication (2.6) by invertible real 2 x 2 matrices gives an action 


L: GL(2,R) > (R5)!; A r> La 


of the real general linear group GL(2,R) on the set R4 of 2-dimensional real 
column vectors. In this example, it is possible to replace the field R of real 
numbers by any other field F’, giving an action 


Li GL(2,F) 3 (FAK La (10.3) 


of the general linear group GL(2, F’) of invertible 2 x 2 matrices over F’ on the 
set Fi} of 2-dimensional column vectors over F. 


A group G is defined by the multiplication, identity, and inversion. Given 
a set X, and maps AJ: X — X for each element g of G, the group properties 
may be used directly to confirm the presence of an action. 


PROPOSITION 10.5 (Conditions for a group action.) 

Let X be a set and let (G,-,e) be a group. Suppose that a map Ag: X > X 
is defined for each element g of G. Then there is a group action (10.2) of G 
on X if and only if 


(a) Ae(x) = x and 
(b) Ag-n(x) = Ag (An(z)) 
for x in X andg,hinG. 


PROOF First, suppose that there is an action (10.2). Since A: G— X! 
is a group homomorphism, ». is the identity element idx of X!, so that (a) 
holds. Again, for elements g and h of G, we have X,4.n = Ag o An, so that (b) 
holds. 

Conversely, suppose that (a) and (b) hold. In particular, (a) shows that 
Ae = idx. Then for each element g of G, condition (b) gives 


Ag 0 Ag-1 = Ag.g-1 = Ae = idx 
and 

Ag-1 ie) Ng = Ag-}-g = Ne = idx 3 
so that A, is invertible. In particular, there is a map 


A:G— XI; gr Ag. (10.4) 


Condition (b), in the form Ag.n = Ag 0 An, shows that the map X of (10.4) is 
a semigroup homomorphism. By Proposition 5.5 (page 96), it follows that 
is a group homomorphism, so that (10.4) becomes an instance of (10.2). J 
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COROLLARY 10.6 
Let x be an element of a set X with a group action (X,G,A). Define 


Gz={g in G|A,(z) =a}. (10.5) 
Then Gz is a subgroup of G. 
PROOF By Proposition 10.5(a), the identity element e of G lies in Gz. 


By Proposition 10.5(b), the set G,, is closed under multiplication. Finally, if 
g lies in G,, then 


Ape) SA (A, ee) SA) Se 
by Proposition 10.5(b), so that G, is closed under inversion. l] 
DEFINITION 10.7 (Stabilizers.) Suppose (X,G,.) is a permutation 


representation. Let x be an element of the set X. Then the group G, of (10.5) 
is called the stabilizer Stab)(x) of x in G. 


The notation G of (10.5) makes no specific mention of the representation 
A. Should reference to 4 be required, the notation Stab) (a) of Definition 10.7 
may be used. 


Example 10.8 (Trivial representations.) 

Let X be a set, and let G be a group. For z in X and g in G, define €,(x) = =. 
The conditions (a) and (b) of Proposition 10.5 are trivially satisfied. The 
representation (X,G,¢) is called the trivial representation of G on X. The 
whole group G is the stabilizer of each element of X. 


Example 10.9 (Inner automorphisms.) 
Let (G,-,e) be a group. For g in G, define 


T]:1G—>G;rrg-r-g'. (10.6) 


Note that 


and 
Tgn(X) = (g-h)-@- (gh) =g-h-a-h-*- go! = 14(ta(x)) 
for g, h and x in G. Thus by Proposition 10.5, 


T:G—>Ghgh Ty (10.7) 
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is an action of the group G on the set G,, the conjugation action. In particular, 
each map Tg : G — G is bijective. Moreover, 


1 —1 


SGP g- gsysg =a) 4) 


Tj(z-y)=Qg-E-Yy-g” 


for x and y in G, so that Tyg : G — G is a group homomorphism. The maps 
Tg are known as inner automorphisms of the group G. (] 


DEFINITION 10.10 (Center, conjugation.) Let G be a group. 


(a) The kernel of the group homomorphism (10.7) is the center Z or Z(G) 
of the group G. 


(b) The image 7(G) of the group homomorphism (10.7) is known as the 
inner automorphism group Inn(G) of the group G. 


(c) For a specific element g of the group G, the map 
™T]:G—>G;rrg-x-g' 
is known as conjugation by g. 
(d) For a specific element x of the set G, the stabilizer 
Colt) ={e i G|x,(2)=2} 


of x in the conjugation action is known as the centralizer of x in G. 


10.2 Orbits 


Let (X,G,A) be an action of a group G on a set X. For an element x of 
X, the orbit of x is the set 


Aa(#) = {Ag(z) |g in G} 


of images of x under the actions \, of the elements g of G. 


Example 10.11 (Plane rotations.) 
For each real number 6, define a map 


vay 1, | cos@ —sin 6] | x1 
pe a Ra 7] a ee ey | ; 


Note that for an angle @ (in radians), the map Kg : R} — R} rotates the plane 
R34 counterclockwise by @ about the origin O. Now (IR},R,«) is an action of 
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the additive group (R,+,0) of real numbers (Exercise 15). Then for a point 
P in the plane R3, the orbit kg(P) is the circle around the origin through the 
point P (Figure 10.1). If the Sun is located at the origin O, and P represents 
a planet, then the mathematical orbit kg(P) of P represents the astronomical 
orbit of the planet around the sun. 


R5 v2 4 
p=|*1 
v2 
e@ es 
O 71 


kr (P) 


FIGURE 10.1: The orbit Kg(P) of a point P in the plane R}. 


A group action may be trimmed down to a smaller action in two ways: by 
passing to a subgroup of the group, or by discarding some of the orbits. 


DEFINITION 10.12 Let (X,G,.) be an action of a group G ona set X. 
(a) For a subgroup H of G, the action (X,H, 2) given by the restriction 
A: H > X!:he r), 
to the subgroup H of the group homomorphism 
A:G—> XIgh rg 
is called the restriction of the action (X,G, A) to the subgroup H. 


(b) Let Y be a union of orbits in the action (X,G,X). Then the action 
(Y,G, 2) given by 
Ag Y > Ys yr Ag(y) 
for each g in G is called the restriction of the action (X,G,A) to the 
subset Y of X. 
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Example 10.13 (Right cosets.) 

Let H be a subgroup of a group G. Consider the restriction (G, H, X) to H of 
the left regular representation of G (compare Example 10.2). Then for each 
element x of G, the orbit A;7(x) is the right coset Ha. Now if K is a subgroup 
of G that contains H, the action (G, H, X) may be restricted to (K, H, A). [] 


For a subgroup H of a group G, the right cosets Hx of H in G are the 
classes of an equivalence relation on G (recall Proposition 4.49, page 83). The 
following proposition states that the orbits of a general group action (X, G, \) 
are the classes of an equivalence relation. The proof, analogous to the proof 
of Proposition 4.49 (page 83), is relegated to Exercise 16. 


PROPOSITION 10.14 (Orbits as equivalence classes.) 
Let (X, G,) be a permutation representation of a group G ona set X. Define 


a relation *% on X by 


gS y ifand only if y=A,(«) for some g in G. 


(a) The relation “2, is an equivalence relation on X. 


(b) For x in X, the equivalence class of x is the orbit AGg(x) of x. 


Example 10.15 (Conjugacy.) 
Let x be an element of a group G. Then the orbit t¢(a) in the conjugacy 
action (G,G,r) of G is the conjugacy class 


{g-x-g"'|g in G} 


of x in G. The equivalence relation 7% on G given by Proposition 10.14 is 
known as conjugacy. In the group GL(2, F’), the conjugacy relation is known 
as matrix similarity. 


10.3 Transitive actions 


Consider a permutation representation (X,G,) of a group G on a set X. 
According to Proposition 10.14, the set X is partitioned by the orbits. The 
representation is said to be transitive if there is just one orbit. Equivalently, 
the action (X, G, ) is transitive if X is nonempty, and for each pair of elements 
x, y of X, there is at least one element g of G with y = \,(z). 
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Example 10.16 (Regular action.) 
For a group G, the regular representation is transitive. See Corollary 4.54 
(page 86) and Exercise 42 in Chapter 4. 


Example 10.17 (Matrix action.) 

Let F be a field. Consider the left multiplication action (10.3) of GL(2, F) 
on F?. This action is never transitive, since the zero column vector is always 
alone in its orbit. However, there is only one other orbit (Exercise 17). ] 


Example 10.18 (Restriction to an orbit.) 
Let (X,G,.) be a permutation representation. For an element x of X, let Y 
be the orbit Ag(«) of . Then the restriction (Y,G, A) to Y is transitive. J 


The most important example of a transitive action is given by the following 
proposition. 


PROPOSITION 10.19 (Action on cosets.) 
Let H be a subgroup of a group G. Consider the set 
G/H={«xH |x in G} (10.8) 
of left cosets of H. Then a transitive action of G on G/H is defined by setting 
Ag(eH) = gxH (10.9) 


for each g in G. 


PROOF First, verify the conditions (a) and (b) of Proposition 10.5. 
(a): For the identity element e of G, (10.9) gives 


Ae(wH) = exH = 2H 


for each element x of G. 


(b): For elements g and h of G, (10.9) gives 
Agn(@H) = (gh)cH = g(hxH) = Ag (An(xH)) 
for each element x of G. 


Now consider elements x and y of G, with corresponding elements 7H and 
yH of G/H. Then for g = y- x7! in G, we have 


A, (cH) =9- cH =y-x'-csH=yH, 


so the action of G on G/H is transitive. (] 
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DEFINITION 10.20 (Homogeneous spaces.) Let H be a subgroup of 
a group G. 


(a) The set G/H of (10.8) is called a homogeneous space. 


(b) The action (G/H,G, A) of Proposition 10.19 is called the homogeneous 
space action of G on G/H. This action is often denoted simply by G/H. 


Example 10.21 
Let G be a group, with identity element e. 


(a) The homogeneous space G/G is trivial. 


(b) For the trivial subgroup {e} of G, the homogeneous space G/{e} is 
essentially the left regular representation of G, elements x of G just 
being rewritten as singleton sets {x}. 


U 


Up to renaming, such as x +> {x} in Example 10.21(b), every transitive 
action is a homogeneous space. 


THEOREM 10.22 (Transitive actions are homogeneous spaces.) 

Let 4: G — X! be a transitive action of a group G on a nonempty set X. 
For each element a of X, consider the stabilizer Ga of a under the action 
(X,G,p). Let (G/Ga, G, r) be the homogeneous space action of G on G/Ga. 


(a) There is a well-defined bijection 
b: G/Gqg > X;h- Ga pn(a). 
(b) For elements g and h of G, 


b(Ag(h- Ga)) = bg (b(h: Ga) - 


PROOF (a): The map 0 is a well-defined injection, since 


h-Gag=k-Ga © k‘h liesin G, 
<=  pe-an(a)=a > ptn(@) = pee (a) 


for elements h and k of G. The map b surjects, since (X, G, js) is transitive. 


(b): We have 
b(Ag(h : Ga)) = b(gh . Ga) _ Hgh(a) _ Lg (tn (a)) = [tg (d(h : G.)) 


for elements g and h of G. l] 
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Theorem 10.22 reduces the analysis of any transitive action (X,G, 1) of a 
group G to computations entirely within the group itself. Pick a fixed element 
a of X, and use the bijection b of Theorem 10.22(a) to label the elements 
Ltn(a) of X by left cosets h- G of the stabilizer G,. Then, to compute the 
effect of the action 4g on an element labeled h - G,, Theorem 10.22(b) shows 
that it suffices to compute the effect of the corresponding homogeneous space 
action A, on the coset h-G. With A,(h- Ga) as the coset gh- Ga, we obtain 
lig(b(h: Ga)) as the element labeled by gh- Ga. The procedure is illustrated 
in Figure 10.2. 


Homo- gh: Ga h-Go Go 
geneous hy 
space -—  - = 
G/Ga 
Labeling Labeling 
b b 
Transitive 


FIGURE 10.2: Tracking a transitive action in a homogeneous space. 


Theorem 10.22 has a corollary that is very useful for counting. 


COROLLARY 10.23 (Orbit sizes divide the group order.) 


Let (X,G, 1) be a permutation representation of a finite group G. Then for 
each element a of X, the orbit uc(a) is finite, with size |uc(a)| = |G|/|G,| 
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equal to the index of the stabilizer Ga. In particular, the size of each orbit is 
a divisor of the order of G. 


PROOF Apply Theorem 10.22(a) to the transitive action (j1¢(a), G, 1) 
obtained by restriction of (X,G,) to the orbit uwc(a) of a. The bijection b 
establishes an isomorphism between the orbit and the set of left cosets of the 
stabilizer G, in G. 


Example 10.24 (Matrix action.) 

Let F be a finite field, with g elements. Consider the left multiplication 
action (10.3) of GL(2, F) on Fi, with orbits as described in Example 10.17. 
The set Fj of two-dimensional column vectors has q? elements, so the orbit 
of nonzero vectors has q? — 1 elements. Corollary 10.23 then shows that the 
size of GL(2, F) is a multiple of q? — 1. In fact, each invertible matrix 


A- ke | 


21 422 
over the field F' is constructed from two column vectors 
a, = | and ag= | 3 
a2] a22 
In setting up the matrix A, the first vector a; may be chosen as any of the 
q? —1 nonzero vectors in F}. In order to guarantee that det A 4 0, the second 


vector a) must then avoid the q scalar multiples la, with | in F. There are 
g? — q such choices for az. In total, there are 


(@ —1)(@? — 4) = (a+ La(q— 1)? 


elements in the group GL(2, F). l] 


10.4 Fixed points 


Let (X,G,.) be a permutation representation of a group (G,-,e) on a set 
X. Let x be an element of X, and let g be an element of G. Then x is said 
to be a fixed point of g (under X) if 


dA, (x) = 2. (10.10) 


Note that x is a fixed point of g if and only if g lies in the stabilizer Gz of x. 
By Proposition 10.5, each element of X is a fixed point of e. 
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Example 10.25 (Conjugation action.) 

Let G be a group. Consider the conjugation action (G,G,r) of G on itself 
— compare (10.7). Then elements x and y of G commute if and only if x is 
a fixed point of y under 7: 


xz-y=y-a if and only if rH yeay TS, (al 


Each element of G is a fixed point of a central element z. In other words, the 
center Z(G) is the set of elements z commuting with each element x of G. [ 


Example 10.26 (Regular action.) 

Let (G,-,e) be a group. Consider the regular representation (G,G, A) of G 
on itself — compare Example 10.2. As observed above, each element of G is 
a fixed point of e under ». For elements x and g of G, the element x is a 
fixed point of g under A if and only if g-« = x, which happens if and only 
if g =e. Thus a nonidentity element of G has no fixed points in the regular 
representation. 


Example 10.27 (Plane rotations.) 

Consider the rotational representation (R?,R, «) of the additive group (R, +, 0) 
of real numbers from Example 10.11. Note that the origin O is a fixed point 
of every real number 6. However, a real number @ that is not a multiple of 27 
has no other fixed points. 


Now let (X,G, ) be a permutation representation of a finite group (G,-, e) 
on a finite set X. For each element g of G, let 7)(g) or a(g) denote the 
number of fixed points of g under A. According to Example 10.26, 


mg) = i ee (10.11) 


0 otherwise 


in the regular representation of a finite group (G,-,e) of order n. 
The following result relates the number of orbits in the representation to 
the number of points that are fixed by each element of G. 


THEOREM 10.28 (Burnside’s Lemma.) 
Let G be a group of finite order |G|. In a permutation representation (X, G, A) 
of G on a finite set X, the total number of orbits is equal to the average number 


a S> (9) (10.12) 


of fixed points of an element of G. In particular, the sum 


S> x(9) 
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of the number of fixed points (g) of each element g of G is a multiple of |G|. 


PROOF Consider the subset 
F ={(g,0) in Gx X| g(a) = 2} 


of G x X. The number of elements in the finite set F’ will be computed in 
two different ways. Equating the two answers will then prove the theorem. 
First, consider the pairs (g,x) in F' for a particular element g of G. Each 
such pair comprises a fixed point x of g in its second slot, so there are 7(g) 
such pairs altogether. Summing over all the group elements g, we obtain 


IF| = do x9) (10.13) 
gEG 


as the first expression for |F'. 
Now let Orb(\) be the set of orbits in the action (X,G,). The set F may 
be described as the disjoint union 


Be Gat 
LEX 


of products of stabilizers with singletons containing points of X. With an 
application of Corollary 10.23, we obtain 


|F| = $2 |Ge| = 9° IG|/|Ae(2)| = |G] 92 1/lAc(2)| 


cEX rEX rex 

=Iel dD doii¥l=lel DD 1 

Y €Orb(A) EY Y €Orb(A) 
=|GJ-|Orb(a)]. (10.14) 


Equating the two quantities (10.13) and (10.14) for |F'| yields the desired 
expression (10.12) for the number |Orb(A)| of orbits. 


Example 10.29 (Regular action.) 
Consider the regular representation of a finite group G with n elements. 
Using (10.11), the expression (10.12) for the number of orbits becomes 


*(n+(n—1)-0) =1, 


which of course is the correct count for a transitive representation. (] 


Example 10.30 (Matrix action.) 
Let F' be the 2-element field Z/2. Consider the left multiplication action 
(F},GL(2, F),L) from Example 10.4. As discussed in Example 10.17, the 
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Blaiietiig/of || Geb sb Gxed poitts | Numbecaty) 
GL(2, F) stgaiel | ek bxearpeinte 
fa Fi 4 

Pol tbl El 
Ha} | tbl 
a} | tbl td 
ho} | tb 
ee 


FIGURE 10.3: Fixed points in the action of GL(2, F) on F}, for F = Z/2. 


number of orbits in this representation is 2. The various elements of GL(2, F), 
with their fixed point sets, are listed in Figure 10.3. 
The expression (10.12) for the number of orbits becomes 


5(443-242-1), 


which reduces correctly to 2. In fact, this count may be used as proof that 
the set of nonzero column vectors forms a single orbit. 


10.5 Faithful actions 
Consider the rotational representation 
KR (R35) 06 Keo (10.15) 


of Example 10.11. Recall that for a real number 6, the action kg rotates 
the plane R4 counterclockwise by an angle of 6 radians. Thus if distinct real 
numbers @ and ¢ differ by a multiple of 27, their actions kg and Ky coincide. 
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DEFINITION 10.31 (Faithful action.) A permutation action 
A:G—> XI;gh rg 


of a group G is said to be faithful if the homomorphism X is injective. 


Example 10.32 (Plane rotations.) 
The rotational representation (10.15) is not faithful. (] 


Example 10.33 (Regular actions.) 
By Cayley’s Theorem, the regular representation of a group G is faithful. (] 


Faithful permutation actions are very close to groups of permutations. 


PROPOSITION 10.34 
Let (X,G, A) be a faithful permutation action of a group G ona set X. Then 
the abstract group G is isomorphic to a group of permutations of the set X. 


PROOF The injective group homomorphism \ : G — X!;g + Ag yields 
an isomorphism 
G— XG):gr Ag 


of G with the group \(G) of permutations of the set X. (] 


In the converse direction, let G be a group of permutations on a set X. 
Let 7: Go X!;g + g be the inclusion of G as a subgroup of X!. Then the 
permutation action (X,G, 7) of G is faithful. The following theorem provides 
a more general source of faithful actions. 


THEOREM 10.35 (Every action induces a faithful action.) 

Let (X,G, A) be a permutation representation of a group G. Let Ker X be the 
group kernel of the homomorphism X: G— X!. For each element g of G, let 
g denote the coset (Ker A)g. Let G denote the quotient group G/Ker X. Then 
there is a faithful action (X,G,) given by a well-defined homomorphism 


A:G—> XhgrdA,. 


PROOF Apply the First Isomorphism Theorem for groups to the group 
homomorphism 
A:G>XI;gr rg, 
factorizing \ as the composite j obo s of the projection G > G;g + g, the 
isomorphism b : G — A(G);g — Aj, and the insertion j : \((G) — X!. Then 
X, as the composite bo s, is injective. J 


GROUP ACTIONS 267 


COROLLARY 10.36 
The kernel of the representation X is the intersection 
Ker \ = al G, 
LEX 
of the stabilizers Gy, of the elements x of X. 


PROOF An element g of G lies in the kernel Ker 4 if and only if 
Ag(a) =x 


for each « in X. This happens if and only if g lies in G, for each x in X. l] 


DEFINITION 10.37 Let (X,G,X) be a permutation representation of a 
group G. Then the faithful representation (X,G, ) of Theorem 10.35 is called 
the faithful representation (or faithful action) induced by (X,G, A). 


Example 10.38 
The rotational representation (R3,R, K) of Example 10.11 induces a faithful 


permutation representation (R},R/27Z,%) of the group (R/27Z, +). ] 


Example 10.39 
Let G be a group. The conjugacy action (G,G,7) of G induces a faithful 
action (G,Inn(G),7) of the inner automorphism group Inn(G) of G. 


10.6 Cores 


Suppose that (X,G,A) is an action of a group (G,-,e). Let x and y be 
elements of X that lie in the same orbit, say Ag(a) = y for some element g of 
the group G. Consider an element s of the stabilizer G, of x in G, so 


As(a) = x. 


Now 
Ng-sg-H(Y) = Ag OAs oAg-1(Y) = Ag OAs(a) = Ag(t) = y, 
so that g-s-g7' lies in the stabilizer G of y. In other words, when \,(x) = y, 
the inner automorphism 7, yields a map 
Tg: Gz > Gy;stog-s-g™' 
from the stabilizer of x to the stabilizer of y. This map is a bijection, with 


two-sided inverse Tg-1- 
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Two subgroups S and T of a group G are said to be conjugate if there is 
an inner automorphism 7, of G with 7,(S) = 7’. We obtain the following: 


PROPOSITION 10.40 (Conjugacy of stabilizers.) 
Let (X,G,.) be a permutation representation of a group G. 


(a) If elements x and y of X lie in the same orbit, then their respective 
stabilizers Gy, and Gy are conjugate in G. 


(b) The conjugate T,(Gz) of the stabilizer G, of an element x of X is the 
stabilizer of the element q(x) of X. 


(c) If the action (X,G,A) is transitive, then the stabilizers G, of elements 
of X form a full set of conjugate subgroups of G. 


Example 10.41 
Consider the action of S3 on {0,1,2}. The respective stabilizers of the points 
0, 1, and 2 are the subgroups {(0), (1 2)}, {(0), (2 0)}, and {(0),(0 1)}. Then 


T(0 1.2): (1 2) (20) (01) (1 2) 
— compare Exercise 38 in Chapter 2. (] 


Proposition 10.40 has an important application to homogeneous spaces. 


DEFINITION 10.42 (The core of a subgroup.) Let H be a subgroup 
of a group G. Then the core of H in G is the intersection 


Coreg(H) = (| 74(H) 
gEG 


of the full set of conjugates of H inG. 
THEOREM 10.43 (Stabilizers in homogeneous spaces.) 
Let H be a subgroup of a group (G,-,e). Let (G/H, G, A) be the corresponding 
homogeneous space, with actions 
Ag: G/H — G/H;«-H- gt-H 
for elements g of G. 
(a) The stabilizer of H in G is H. 


(b) For each element g of G, the stabilizer of \,(H) = 9-H in G is T,(H). 
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(c) The group kernel of the representation 
\:G— (G/H)! 


is the core of H in G. In particular, the core of H is a normal subgroup 


of G. 


(d) If N is a normal subgroup of G that is contained in H, then N is also 
a subgroup of the core of H. 


PROOF (a): An element g of G lies in the stabilizer of H if and only if 
H =4,(H) = 9-4, so if and only if g lies in H itself. 
(b): Apply (a) and Proposition 10.40(b). 
(c): Apply (b) and Corollary 10.36. 
(d): Ifa normal subgroup N is contained in H, then N = 7,(N) is contained 
in each conjugate T,(H) of H. l] 


Parts (c) and (d) of Theorem 10.43 are often summarized by saying that 
the core of H is the largest normal subgroup of G that is contained in H. 
Theorem 10.43 has a useful corollary. 


COROLLARY 10.44 (The index of a core.) 
Let G be a group, with a subgroup H of finite index r. 


(a) The index of Coreg(H) in G is finite. 
(b) The index of Coreg(H) in G is a divisor of r!. 
(c) The index of Coreg(H) in G is a multiple of r. 


PROOF (a): By Theorem 10.43(c) and Theorem 10.35, the homogeneous 
space representation (G/H,G.) induces a faithful representation 


(G/H, G/Coreg(H), X) 


of the quotient group G/Coreg(H). Now Proposition 10.34 implies that the 
group G/Coreg(H) is isomorphic to a subgroup S of the finite group (G/H)! 
of order r!. 

(b): By Lagrange’s Theorem, the order of S is a divisor of r!. 

(c): By Lagrange’s Theorem, |G/Coreg(H)| = r- |H/Coreg(H)|. l] 


Example 10.45 (Subgroups of index 2.) 

Let G be a group, and let H be a subgroup of index 2. Corollary 10.44 shows 
that the proper subgroup Coreg(#) of G has index 2, and therefore coincides 
with H. In other words, H is a normal subgroup of G. (Exercise 28 asks for 
a direct proof that a subgroup of index 2 is normal.) 
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10.7 Alternating groups 
Let n be a positive integer, and let 
In > +++ >%1 > Xo 


be a set of n+ 1 distinct real numbers. For a in S,4, = {0,1,. 
the sign e(a) of the permutation a to be the quotient 


i>i La(j) — Vali a(j) — %ali 
e(a) = Lnzirizoltacs) ~ acy) _ TT cen tee., 


Hnzj>izo(es ~ #1) io. 2 
For example, if a is the permutation (0 1 2) in $3, then 


_ {vo = £2)(#o — £1) (%2 — #1) 
ON aaa ee) 


If a is the permutation (0 1) in S3, then 


ale 


_ (@2 = 0) (@2 = 1) (0 = 1) _ 
OF ae Sa ae 


..,n}!, define 


(10.16) 


(10.17) 


The numerator of the left-hand fraction in (10.16) includes all the factors from 
the denominator, either with the same or reversed order. Thus the sign of a 


permutation is either 1 or —1. 


PROPOSITION 10.46 
Let n be a positive integer. Then the sign map 


E: Sn41 2 {£l};a% (a) 


(10.18) 


is a group homomorphism from Sn41 onto the group of units {+1} of the 


monoid (Z,-,1) of integers. 


PROOF For permutations a and ( in S;,41, we have 


LaoB(j) ~ YaoBli 
e(ao 3) = II B(3) B(t) 


Li Xj 
n>j>i>0 J g 


Le 3) — La 
n>joiro ~PG) "8 ns5Sa>0 


x 
n>j>i>0 ° 


= e(aje(9). 
By (10.17), <(0 1) = —1, so ¢ is surjective. 


n>j>ir0 


Va(j) — Lali) UB(j) — TBC) 
Il Li Xi Il Li Uy 


Ta0B(j) — FaoB(i) UB(j) — YB(i) 
Li Xi 
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DEFINITION 10.47 Let n be a positive integer. 


(a) The group kernel Kere of the sign homomorphism (10.18) is known as 
the alternating group An4+1. 


(b) A permutation is said to be even if its sign is 1. 


(c) A permutation is said to be odd if its sign is —1. 


Recall that |S;,41| = (n+1)!. By the First Isomorphism Theorem for groups 
applied to (10.18), it is apparent that 


1 
[Angi] = s(n +0)! 


for each positive integer n. 

Although the definition (10.16) of the sign is ideally suited to the proof of 
Proposition 10.46, it is not very practical for computation. If x and y are two 
distinct elements of a set X, then the cycle 


(x y) 


that interchanges x and y, while fixing all the other elements of X, is called 
a transposition (as an element of the permutation group X!). Transpositions 
in S;,41 are odd: 


PROPOSITION 10.48 (Transpositions are odd.) 
For a positive integer n, let o be a transposition (k 1) in Sn41. Then o is an 
odd permutation. 


PROOF Suppose that | > k. In order to determine <(7), we examine the 
factor 
Vo(j) ~ Voi) 


Li Uy 


(10.19) 


from the right-hand side of (10.16) for each pair j > i of integers between 
n and 0. If the set {j,i} has no element in common with {l,k}, then the 
numerator and denominator of (10.19) are equal, canceling to 1. For the 
pairs 7 >l and j > k, the product 


(@o(j) — Foy) (oj) — Vo(e)) _ (Hj — Le) (4j — 21) 


(a4 — a1)(a3 — Ze) (xj — 21)(x3 — ZK) 


cancels to 1. Similarly, for the pairs | > 7 and k > i, the product 


(%o(t) — Lo(s))(Lo(k) — Lo(i)) _ (ye — 2) (1 — 2%) 


(x, — %i)(Zp — Za) (x1 — xi)(Zk — Xi) 
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cancels to 1. For | > j and j > k, the product 


(Lo) — o(g))(Lo(j) — Lo(ky) _ (We — @y) (ay — a1) 


(ai. = 27 (aj — Sy) (x, — %3)(x3 — ZK) 


reduces to 1. Finally, for the pair | > k, the quotient 


Lo(l) ~ Vo(k) — Tk — XI 
tT] — Lk T1 — Lk 


reduces to —1. Overall, e(a) = —1. l] 


COROLLARY 10.49 
A permutation is even or odd if and only if it can be expressed as a product 
of an even or odd number of transpositions respectively. 


PROOF Suppose that a permutation a is the product of t transpositions. 
Then Proposition 10.48 and Proposition 10.46 show that e(a) =(—1)'. I 


Using Corollary 10.49, Proposition 10.46, and the fact that permutations 
in S;,41 decompose as products of disjoint cycles, it is possible to determine 
the sign of a given permutation more directly. Noting the equality 


(ao a1 ag ... Ap) = (Ao Ay) O-+- 0 (Aq a2) 0 (ao a1) (10.20) 
for distinct elements ag, a1,...,@, of n with r+1 <n (Exercise 31), we obtain 
E(ap a, ... ay) =(-1)": 


cycles of odd length are even, while cycles of even length are odd. If a general 
permutation a is the product of h cycles of respective lengths 11,...,l,, the 
sign €(a@) is then given as the power 


(ater 


of —l. 


Example 10.50 

Consider the symmetries of the regular tetrahedron that are discussed in 
Study Project 2, Chapter 2. The 8 cycles of length 3 that appear in (2.33) are 
even, as are the 4 elements of the Klein 4-group V4 (Example 2.31, page 38). 
Together, these 12 even permutations form the full alternating group A4. 
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10.8 Sylow Theorems 


In the final section of this chapter, we use group actions to relate the order 
of a finite group with its structure. 


PROPOSITION 10.51 

Let p be a prime number. For positive integers a and m, let G be a group of 
finite order p’m. (The prime p may be a divisor of m.) Let ng be the number 
of subgroups of G of order p*. Then 


Ng =1 mod p. (10.21) 


In particular, G does have a subgroup of order p*. 


PROOF Let X, be the set of all the p*-element subsets of G, subgroups 
or not. For each p*-element subset S of G, define 


a,(S)=9-S 


for g in G. Since the conditions of Proposition 10.5 are satisfied, we obtain an 
action (Xq,G,o) of Gon X,. For a p*-element subset S$ of G, Corollary 10.23 
shows that the orbit 7¢(S) contains |G|/|Gs| elements, G's being the stabilizer 
of S in the action. Now Gs -S = S implies that S is a union of right cosets 
of the subgroup Gg, so |Gs| = p? with b < a. 

Let x be an element of G. If s is an element of S, then x lies in o,,-1(S). 
In other words, for each p*-element subset S of G, the subsets in the orbit 
oG(S$) cover all of G. 


e If no two subsets in the orbit o¢(S') overlap, then there are exactly m 
subsets in the orbit. Orbits of this kind are called nonoverlapping. 
This case obtains precisely when G'g is a subgroup of G of size p*. 
In particular, Gg is the unique member of the orbit that contains the 
identity element 1 of G. Thus the number of nonoverlapping orbits is 
Na, the same as the number of subgroups of size p*. 


e Otherwise, the size p*m/p? of the orbit aG(.S) exceeds m, and thus is 
some multiple of pm. Orbits of this kind are described as overlapping. 


The sum of all the orbit sizes, the total number of p*-element subsets of a 


p*m-element set, is 
pem 
10.22 
Ce) ana 
(compare Exercise 43 in Chapter 6). Let & be the number of p*-element 
subsets that lie in overlapping orbits. The size of each overlapping orbit is 
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congruent to 0 modulo pm, sok =0 mod pm. There are ng nonoverlapping 
orbits, each containing m subsets. Thus 


c IP) = nam keg mod pm. (10.23) 
p 
If G is the cyclic group Z/p%mZ, then there is a unique subgroup (m+ p*mZ) 
of size p%, so n. = 1 in this case. Thus the binomial coefficient (10.22) is 
congruent to m modulo pm. 

Now return to the general group G of order p*m. The congruence (10.23) 
becomes 

n= ie :) =n: M+k=nag:m mod pm. 
pe 

Since pm divides ng +m —m = (na —1)-m, we have p| (mq —1). The desired 
congruence (10.21) follows. 


Let x be an element of a finite group G. Lagrange’s Theorem shows that 
the order |(x)| of x is a divisor of |G]. Conversely, for a divisor d of |G], there 
may be no element x of order d. For example, the symmetric group S3 has 
no element of order 6. The first consequence of Proposition 10.51 shows that 
for prime divisors, the situation is different. 


COROLLARY 10.52 (Cauchy’s Theorem.) 
If p is a prime divisor of the order of a finite group G, then G contains an 
element of order p. 


PROOF Taking a = 1 in Proposition 10.51 shows that G has a subgroup 
HAT of order p. Each nonidentity element of H has order p. l] 


We now study subgroups of maximal prime-power order. 


DEFINITION 10.53 (Sylow subgroups.) Let G be a nontrivial finite 
group of order n. Suppose that n factorizes as a product 


E1 ,€2 


N= pips’ ..-p," 


of powers of distinct prime factors p,,p2,.--,Dr- 


(a) For each such factor p;, a subgroup of G of order p;‘ is called a Sylow 
pi-subgroup of G. 


(b) The set of all Sylow p;-subgroups of G is written as Syl,,(G). 
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THEOREM 10.54 (Sylow’s First Theorem.) 
Let G be a finite group, and let p be a prime factor of the order of G. Then 


ISyl,(G)| =1 mod p. 


In particular, G has at least one Sylow p-subgroup. 
PROOF § In Proposition 10.51, take the case where m is coprime to p. (] 


Example 10.55 

Consider the alternating group Ay, of order 12 = 2? -3 (see Example 10.50). 
The Klein 4-group is the unique Sylow 2-subgroup. The 8 cycles of length 3 
that appear in (2.33) pair up, together with the identity element, to form 4 
Sylow 3-subgroups. Note that 4=1 mod 3. 


THEOREM 10.56 (Sylow’s Second Theorem.) 
Let G be a finite group, and let p be a prime factor of the order of G. 


(a) All the Sylow p-subgroups of G are conjugate. 


(b) If |H| = p* for a subgroup H of G, and some positive integer a, then H 
is contained in a Sylow p-subgroup of G. 


PROOF Let P be a particular Sylow psubgroup of G. To prove (b), 
we will show that H is contained in some conjugate 7,(P) of P. Statement 
(a) follows by taking H to be a Sylow p-subgroup of G. In this case, since 
|H| = |P| = |t,(P)|, the containment of H in 7,(P) implies the equality of H 
and T,(P). 

To prove (b) by contradiction, suppose that H is not contained in any 
conjugate 7,(P) of P. Then the intersection 


HO 7,(P) ={g in G|g in H and g in 7,(P)} 


is a proper subgroup of H, so the quotient |H|/|H  T(P)| is divisible by p 
for each element g of G. 
For h in H and p in P, define 


Bap (x) =h-a-p* 


for x in G. Since 
Bek,q) (Bcn,p) (x)) = k(hap~')q7' = (kh)x(qp)—* = Bk.q) (hyp) (x) 


for k in H and q in Q, Proposition 10.5 shows that we obtain an action 
(G,H x P,G) of Hx PonG. 
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The orbit By, p(1) of the identity element 1 of G is the subset H - P of G. 
The stabilizer of 1 is 


{(h,p) in Hx P| hp} =1}={(h,hk*) | hin HNP}. 
Thus 
|H- P| =|A|-|P|/|HnP|. (10.24) 
The orbit Gx p(g) of an element g of G is the set HgP (a double coset in the 
notation of Exercise 39 in Chapter 4). Apply Proposition 4.47, and (10.24) 
with 7,(P) in place of P. The orbit 6, p(g) is seen to have size 
% |H| -|7o(P)| 
|HgP| = |HgP9 "| = |H -19(P) = a 
: |H 9 T4(P)| 


a multiple of p|P|. Since |G| is the sum of the sizes of the orbits, we obtain 
the contradiction that |G] is a multiple of p|PI. 


COROLLARY 10.57 
Let G be a finite group, and let p be a prime factor of the order of G. If P 
is a unique Sylow p-subgroup, then P is a normal subgroup of G. 


PROOF For each element g of G, consider the conjugate 7,(P). As a 
Sylow p-subgroup of G, it coincides with P. Thus P is normal. l] 


Sylow’s Theorems may be used to obtain strong limitations on the number 
of Sylow subgroups. 


PROPOSITION 10.58 
Let G be a finite group, and let p be a prime factor of the order of G. Then 


|Syl,(G)| =1 mod p (10.25) 


and 


[Syl,(G)| | |Gl. (10.26) 


PROOF The congruence (10.25) is just a restatement of Sylow’s First 
Theorem. The divisibility relationship (10.26) follows by Corollary 10.23, 
since G acts transitively on Syl,(G) by conjugation. l 


Example 10.59 

Let G be a group of order 33. Since 1 is the only divisor of 33 = 3-11 
which is congruent to 1 modulo 11, Proposition 10.58 shows that there is a 
unique Sylow 11-subgroup of G. By Corollary 10.57, this subgroup is normal. 
A similar argument shows that there is a unique Sylow 3-subgroup, again a 
normal subgroup. 
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10.9 Exercises 
1. Let (G,-,e) be a group. Define a map 
pg:GoGjrnu-g! 
for each element g of G. Show that there is a permutation representation 


p:GoGlhgr pg. (10.27) 


The representation (10.27) is called the right regular representation of 
the group G. 


2. Let G be a group. For elements g and h of G, show that 
Ag 2 Ph = Pro Ag 
in the notation of Example 10.2 and Exercise 1. 
3. Let G be a group. Show that 
Tg = Ag ° Pg 
for each element g of G. 
4. Let (G,-,e) be a group. If 
Ago pr(e) =e 
for elements g and h of G, show that g = h. 


5. Let (X,G,A) and (Y,G, ) be actions of a group G. 


(a) Show that an action 
(X x Y,G,n) 


is defined by 
T(x, y) = (Ag(2), Hg(y)) 


for g in G, « in X, and y in Y. [The action 
(X x Y,G,n) 


is known as the product of the actions (X,G,A) and (Y,G, y).] 
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(b) If the sets X and Y are disjoint, show that an action 
(X UY,G,c) 


is defined by 


(2) Ag(z) for z in X 
o,(z) = 
7 Vg(z) for zin Y 


for each g in G. [The action 
(X UY,G,c) 
is known as the sum of the disjoint actions (X, G, A) and (Y,G;, y).] 
6. Show that the center of a group is abelian. 
7. Show that a group G is abelian if and only if G = Z(G). 


8. For an abelian group G, show that the conjugation action is the trivial 
representation of the group G on the set G. 


9. Let N be a subgroup of the center of a group G. 


(a) Show that N is a normal subgroup of G. 
(b) If the quotient group G/N is cyclic, show that G is abelian. 


10. Consider the symmetric group 53. 


(a) Show that the center of $3 is trivial. 
(b) Show that Ss & Inn($3). 


11. Show that the center of the general linear group GL(2, F’) over a field 


F is the set 
x0 ; 


of nonzero multiples of the identity matrix. 
12. Show that there is an isomorphism G/Z(G) © Inn(G) for each group G. 


13. Let G be a group. 


(a) Using the notation of Example 10.2 and Exercise 1, show that the 
map 
T:Gx G— Gl; (g,h) Ag ° pr 
is a group homomorphism. 


(b) Determine the kernel of the homomorphism T. 


14. 


15. 


16. 
17. 


18. 


19. 


20. 
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For each normal subgroup N of a group (G,-,e), define 
N={(g,n-g)|g in G,n in N}. 


(a) Show that N is a subgroup of G x G. 


(b) Show that a normal subgroup N of G is contained in Z(G) if and 
only if {e} is a normal subgroup of N. 


In Example 10.11, verify that «& gives an action of the additive group 
(R,+,0) of real numbers on the plane R$. (Compare Exercise 34 in 
Chapter 4.) 


Prove Proposition 10.14. 
For a field F’, consider the action (10.3) of GL(2, F) on Fi. 


[7] 


in F}, show that there is an invertible matrix 


(a) For each nonzero vector 


A= be a] 
a21 422 
in GL(2, F) such that 
1 
La 4 =X 


(b) Conclude that the set of nonzero 2-dimensional column vectors 
forms a single orbit in the action (10.3). 


Let H be a subgroup of a group G. Let (X,G,A) be an action of G, 
with corresponding restriction (X,H,A) to the subgroup H. Let x be 
an element of X. 


(a) Show that the orbit Aq() is a subset of the orbit AG(x). 

(b) Show that the orbit Ag¢(«) is a disjoint union of orbits Aq(y). 
Consider a permutation a = (#1 @2 ... Lp—-1 Z,) of a finite set X. Show 
that a permutation y of X is conjugate to a in the symmetric group X! 


if and only if y = (y1 yo .-- Yr—1 yr) for distinct elements y1,...,y, of 
X. (See Exercise 38 in Chapter 2.) 


Determine the conjugacy class of the matrix 


i 


in the group GL(2, Z/2) of invertible 2 x 2 matrices over Z/9. 
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21. 


22. 


23. 


24. 


25. 


26. 
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For a field F’, define the trace function 


. p2 {411 412 
tr: Fy ~ F 


> Q11 + G22 
p) 
| 


— compare (5.17). Let A and B be invertible 2 x 2 matrices over F’. 


(a) Show that tr(AB) = tr(BA). 


(b) If A and B are conjugate in the group GL(2, F’), show that tr(A) = 
tr(B). 


(c) Give an example to show that tr(A) = tr(B) does not imply the 
conjugacy of A and B. 

Let (M,-,e) be a monoid, with group of units M*. For u in M*, define 

Au: Ma M;2-u-m. (10.28) 


(a) Show that (M, M*, A) is an action of M* on M. 

(b) Show that M is a group if and only if (M, M*, d) is transitive. 
Consider the permutation representation (Z,Z*,A) given by (10.28) for 
the monoid (Z,-,1) of integers under multiplication. Show that the 


orbits are the classes of the kernel relation ker sq of the squaring function 
(2.3). 


Consider the permutation representation (Z/,(Z/s)*, A) given by (10.28) 
for the monoid (Z/gs,-,1) of integers modulo 8 under multiplication. 
(a) Determine the orbits. 


(b) Determine the number z(w) of fixed points for each element wu of 
(Z/s)*. 
(c) Verify (10.12) for this action. 


Repeat Exercise 24 for the monoid (Z/g9,-,1) of integers modulo 9 under 
multiplication. 


Let G be a finite group with just two conjugacy classes. Suppose that 
the order of G is n. 


(a) Show that all the centralizers of nonidentity elements of G are 
conjugate. 


(b) Show that the centralizer of a nonidentity element has order 
n/(n—1). 


(c) Conclude that G has order 2. 


27. 


28. 


29. 


30. 
3l. 


32. 


33. 


34. 


35. 


36. 
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Let G be a group of order 35, and let H be a subgroup of order 7. Show 
that H is a normal subgroup of G. 


Let G be a group, and let H be a subgroup of index 2 in G. Give 
a direct proof that H is a normal subgroup of G (without invoking 
Corollary 10.44). 


Let G be a group of finite order n. Suppose that n factorizes into a 
product 


N= pips «1p, 
of powers of prime numbers pj,..., px, with py < po <--- < pr. If A is 
a subgroup of G of index p,, show that H is a normal subgroup of G. 


Show that a permutation a is odd if and only if its inverse a~+ is odd. 


Verify (10.20) by showing that both sides have the same effect on each 
element 7 of n. 


Determine the signs of each of the following permutations: 
(a) (0 1 2) 0 (3 4) 0 (56) ; 

(b) (012345)0(6789); 

(c) (012345)0(2467). 


Show that the identity permutation cannot be expressed as a product 
01, 9020°'::00p 


of an odd number h of (not necessarily distinct) transpositions oj, c2, 
+5 On. 


A nonidentity element x of a group (G,-,e) is an involution if 
CSC) 


For a positive integer n, show that each involution in the symmetric 
group S,+1 is a product of disjoint transpositions. 


Let G be a group in which each nonidentity element is an involution. 
Show that G is abelian. 


Determine the conjugacy class of the 3-cycle (0 1 2) in each of the 
following groups: 

(a) The symmetric group $3; 

(b) The symmetric group S4; 

(c) The alternating group Aq. 
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37. Consider an integer n > 1. 


(a) Show that each transposition in the symmetric group S, may be 
expressed as a product of transpositions from the set 


Tn ={(kK-1k)|0<k<n}. 
(b) Show that each element of the symmetric group S,, is a product of 


transpositions from the set T), . 


(c) Write each element of the symmetric group $3 as a minimal product 
of elements of the set 


Ts; = {(0 1), (1 2)}. 


(d) Express the element (0 2) of the symmetric group 53 in two different 
ways as a minimal product of elements of the set 


Ts = {(0 1), (1 2)}. 
38. Let S be a subset of a finite group (G,-,e). 


(a) Show that if 
2|S| > |G, (10.29) 
then S-S=G. 
(b) Show that the strictness of the inequality in (10.29) is essential. 
39. Show that the Klein 4-group V4 is a normal subgroup of the alternating 
group Ag. 
40. Determine the Sylow subgroups of the symmetric group 53. 


41. Let G be a group of order 15. Show that the Sylow subgroups of G are 
normal. 


42. Consider the set 
xt 
o={ [5] 


of invertible upper triangular matrices over the ring Z/3 of integers 
modulo 3. 


x,y,z in Z/s, vy 40h 


(a) Show that G forms a group under matrix multiplication. 

(b) Show that |G] = 12. 

(c) Show that the diagonal matrices in the set G form a subgroup D 
of order 4. 

(d) Is D a normal subgroup of G? 


(e) Determine whether the group G is or is not isomorphic to the 
alternating group A. (Hint: Consider Sylow 2-subgroups.) 
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10.10 Study projects 


1. Even and odd functions. Consider the set C(R) of all continuous 
functions f : R — R. For such a function, define 


Tf: R—-R «cr f(-2). 
Define f to be even if Tf = f, and odd if Tf = —f (see Figure 10.4). 


(i) fg (ii) th 


FIGURE 10.4: (i) An even function g. (ii) An odd function h. 


(a) Show that the assignments (0 1) +> T and (0) + idea) give an 
action of the symmetric group Sj = {(0), (0 1)} on C(R). 

(b) For each natural number n, show that the power function x” is 
even if n is even, and odd if n is odd. 

(c) Show that each continuous function f is the sum g +h of an even 
function g and an odd function h. 

(d) Suppose that a continuous function f is both even and odd. Show 
that f is identically zero. 

(e) If f : R > R is a continuous odd function, show that 


q f(a)dx =0 


for each positive real number r. 

(f) If f : R > R is a continuous odd function, what is the value of 
f(0)? Justify your answer. 

(g) If f : R — R is a differentiable even function, what is the value of 
f'(0)? Justify your answer. 
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2. The projective line. Let F be a field. In the plane Fy, each line 


through the origin is the set 


Xh = { | | hyx, + hoxe = of 


of solutions 


to the matrix equation h?x = 0 or 
[Ai he] ia =0 
Bp) 


for a nonzero row vector [ha ha], the transpose h? of a column vector 


The vertical line (or x-axis) is X{, 9). Each nonvertical line, of slope 
m, is Xfm —1]. Note that Xy = X, if and only if h = ck for a nonzero 
constant cin F’. The set of all the lines Xy, through the origin is called 
the projective line or the 1-dimensional projective space PG(1, F’) over 
the field F’. In this context, a line Xp through the origin in the plane 
F} is called a point on the projective line PG(1, F). 


(a) Consider the action (F},GL(2, F), L) of Example 10.4, with 
La(x) = Ax 
for a matrix A in GL(2, F) and a column vector x in F}. For each 
point Xy on the projective line PG(1, F’), show that 
Aa(Xn) = {La(x) |x in Xp} (10.30) 
is again a point on the projective line. 
Show that Aa(Xn) = X(A-1)Th- 
(c) Show that there is an action 
(PG(1, F), GL(2, F), A) 
of GL(2, F) on PG(1, F) with A4 defined by (10.30). 
(d) Show that the action (PG(1, F), GL(2, F), A) is transitive. 
(ec) Show that a line Xy through the origin in F} is a fixed point of 
an invertible matrix A in the action (PG(1, F),GL(2, F), A) if and 
only if the nonzero members of Xy are eigenvectors of A. 


(f) For the 2-element field F' = Z/2, determine the set of fixed points 
in PG(1, F’) for each element A of GL(2, F). 


(g) Show that GL(2,Z/2) & Ss. 


yee 
& 
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3. The Class Equation. Let (G,-,e) be a group of finite order n, with 

conjugacy classes 
Ch = {e},Co,...,C5. 

Applying Proposition 10.40(b) to the conjugacy action (G, G,7T) of G, it 
is apparent that for 1 <i <_s, the various elements x of the class C; all 
have conjugate centralizers Cg(x). Let k; denote the common order of 
the centralizers Cg(x) of the elements x of C;. In particular, note that 
ky =n. 


(a) Show that |C;| = n/k; for 1 <i<s. 
(b) Obtain the Class Equation 


“on 
=1 — 10.31 
n et 3 (10.31) 
or its equivalent form 
“1 
1= —. 10.32 


— 
lo) 
oF 


Suppose that a finite group has prime power order p°. By reading 
the Class Equation (10.31) modulo p, show that C; cannot be the 
only conjugacy class containing just one element. Conclude that a 
group of prime power order has a nontrivial center. 


If p is a prime number, show that a group of order p? is abelian. 


* a 
o 
ee 


For a prime number p, consider the set 


luz 
=| Oly 
001 


L,Y, in zip} 


of 3 x 3 matrices over Z/,. Show that under matrix multiplication, 
U, forms a nonabelian group of order p?. Determine Z(U,). 
Prove the following statement by induction on the positive integer 
s: For each positive real number 7, there are only finitely many 
(and maybe no) decreasing sequences 


— 
a 


ky > ko >-++ > ks 


of s positive integers k; such that 


a1 
=e, = (10.33) 
i=1 


(Hint: If (10.33) holds, then 0 < ks < s/r.] 

By considering the Class Equation in its alternative form (10.32), 
show that there are only finitely many possible orders n for a finite 
group with a fixed number s of conjugacy classes. (Hint: k, = n.) 


FORE 
ithe} 
NS 
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4. Simple groups. A nonabelian group is said to be simple if it has no 
proper, nontrivial normal subgroups. Suppose that a group G has a 
normal subgroup N. Then many properties of G (such as the order) 
may be recovered from the smaller groups G/N and N. This kind of 
decomposition continues until we reach simple groups or abelian groups. 
Thus determination of the simple groups is an important basic step 
towards the study of general groups. 


Consider the group As, of order 60. 


Show that As contains 20 permutations of the form (a 6 c), all of 
which are conjugate in As. 


Show that As contains 15 permutations of the form (a b)(c d), all 
of which are conjugate in As. 


Show that A; contains 24 permutations of the form (a bc d e). 


Show that the 24 permutations of the form (a 6 c d e) break up 
into two conjugacy classes in As, each of size 12. 


Show that the conjugacy classes of As have respective sizes 
1,20, 15,12,12. (10.34) 


Show that no sum of numbers from the list (10.34), except for 1 
and 60, is a divisor of 60. 

Given that a normal subgroup N of As is a union of conjugacy 
classes, with |N| | 60, show that As is simple. 


10.11 


Section 


Notes 
10.1 


In a permutation representation (X,G, ), the stabilizer G, of a point x of 
X is also known as the isotropy subgroup or inertial subgroup. 


Section 


10.4 


W. Burnside was an English mathematician who lived from 1852 to 1927. 


Section 


10.8 


A.L. Cauchy was a French mathematician who lived from 1789 to 1857. 
L. Sylow (or Sylov) was a Norwegian mathematician who lived from 1832 to 
1918. (His name is pronounced “seal off,” not “sigh low.” ) 


Chapter 11 


QUASIGROUPS 


The multiplication in a group (G,-) satisfies two key properties. Along with 
associativity, there is the property discussed in Corollary 4.54 (page 86): If 
the equation 


LEsy=zZ 


holds in G, then knowledge of any two of the elements x, y, z of G specifies 
the third uniquely. In particular, the latter property implies that the body 
of the multiplication table of a finite group is a Latin square (Theorem 4.55, 
page 86). For many purposes, the combination of these two group properties 
is too strong, and it becomes necessary to consider sets that are closed under 
a multiplication satisfying just one of the two properties. Sets closed under an 
associative multiplication are semigroups. This chapter studies sets that are 
closed under a multiplication satisfying the second property. Such structures 
are known as quasigroups. 


11.1 Quasigroups 


DEFINITION 11.1 (Quasigroups.) Let a given set Q be closed under a 
multiplication x-y or xy of its elements x, y. Suppose that when the equation 


Ly=zZz 


holds for elements x, y, z of Q, then knowledge of any two of x, y, z specifies 
the third uniquely. In this case, the structure (Q,-) or Q is said to be a 
quasigroup. 


By Corollary 4.54 (page 86), each group (G,-) forms a quasigroup. The 
empty set also forms a quasigroup: There are no elements for which the 
closure and equation-solving conditions of Definition 11.1 have to be checked. 
This quasigroup is associative, since it contains no counterexamples to the 
associative law. The following examples give nonassociative quasigroups. 
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Example 11.2 (Subtraction modulo 4.) 
Consider the set Z/4 of integers modulo 4, with the operation of subtraction. 
Consider the equation 

L-Yy=z (11.1) 


between elements x, y, z of Z/4. If x and y are given, then (11.1) specifies z 
uniquely. If (11.1) holds, and y, z are given, then x is specified uniquely as 
x=yt+z. If (11.1) holds, and 2, z are given, then y is specified uniquely as 
y=a-—z. Thus (Z/4,—) is a quasigroup. Since 


(1-1)-1=341=1-(1-1), 
the associative law is not satisfied: The quasigroup is not a group. (] 


Arguing as in Example 11.2, it may be shown that the set of integers forms 
a nonassociative quasigroup under the operation of subtraction (Exercise 1). 


Example 11.3 (Arithmetic means.) 
Consider the set R of real numbers. The arithmetic mean of two real numbers 
x and y is 
xc+y 
5 


LOY = 


Geometrically, the arithmetic mean represents the midpoint of the real line 
segment from x to y (Figure 11.1). Note that coy = you, so o gives 
a commutative multiplication on the set R of real numbers. Consider the 
equation 


Loy=z (11.2) 


between real numbers x, y, and z. Certainly the arithmetic mean z is uniquely 
specified by x and y. If y and z are given, then x is uniquely specified as 
x = 2z—y. Similarly, y is uniquely specified by (11.2) in terms of x and z. 
Thus (R, 0) is a quasigroup. Since 


(004)08=208=643=006=00 (408), 


the associative law is not satisfied: The quasigroup is not a group. (] 


FIGURE 11.1: The arithmetic mean x 0 y of real numbers « and y. 
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Example 11.4 (Nonzero bit strings.) 

Fix a natural number r. The set Ca al of bit strings of length r+ 1 
carries the componentwise abelian group structure inherited from the group 
(Z/2,+) of integers modulo 2 (Example 4.35, page 78). The zero element of 


the additive group ((Z/2)"*, +) is the string 000...00 of r+1 zeroes. Let P, 


denote the subset of (Z / a consisting of nonzero bit strings. The set P, has 
2"*1 _ 1 elements, which may be considered as the binary expansions of the 
positive integers less than 2"+!. It is often convenient to write the elements 
of P, as such integers (using base ten expansions), rather than writing long 
bit strings. Thus P, becomes 


{01,10,11} or {1,2,3}, 
while P, becomes 
{001,010,011,100,101,110,111} or {1,2,3,4,5,6,7}. 


Now define a multiplication *« on P, by 


ee 
veya" a (11.3) 


This means that in the equation 
L*KY =z 
for elements x, y, z of P,, either x = y = z or 
r+ty+z=0 


in the group (Zia) A). At any rate, the bit string quasigroup (P,,*) 
is a quasigroup for each natural number r, and is not a group for r > 0 
(Exercise 4). ll 


11.2 Latin squares 


By definition, quasigroups satisfy the property (Corollary 4.54, page 86) 
ensuring that the body of a finite group multiplication table is a Latin square. 
Thus the body of the multiplication table of a finite (nonempty) quasigroup 
will also be a Latin square. For illustration, the multiplication table of the 
quasigroup (Z/4, —) of Example 11.2 is shown in Figure 11.2. Note that each 
of the four integers modulo 4 appears exactly once in each row and each 
column of the body of the table. 
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- 0 1 2 3 
0 0 3 2 #1 
1 1 0 3 2 
2 2 1 0 3 
3 3 2 1 O 


FIGURE 11.2: Subtraction modulo 4. 


In Figure 11.2, it is possible to change the column labels, making the table 
that results present the addition operation for the group of integers modulo 4 
(Exercise 6). In general, however, there may be no way to label a Latin square 
so that it forms the body of a group multplication table. (In Section 11.7, it 
will be shown that the smallest such Latin square has size 5 x 5.) But as the 
following theorem indicates, a Latin square always gives a quasigroup table. 


THEOREM 11.5 (Latin squares as quasigroup tables.) 
A finite, nonempty set Q with a multiplication - forms a quasigroup (Q,-) if 
and only if the body of the multiplication table of (Q,-) forms a Latin square. 


PROOF If (Q,-) is a finite, nonempty quasigroup, then the argument 
showing that the body of the multiplication table of (Q,-) forms a Latin square 
is exactly the same as the argument used to prove Theorem 4.55 (page 86) — 
compare Exercise 7. 

Conversely, suppose that @ has n elements, say 21,...,%p in acertain order. 
Consider the multiplication table of (Q,-), with the rows and columns each 
labeled by the elements x1,...,%» in order. Suppose that the body of the table 
forms a Latin square. Note that for 1 <i,7 <n, the table entry appearing in 
the row labeled x; and the column labeled 2; is x;-x; (Figure 11.3). Suppose 
that the equation 

LOCH LE (11.4) 


holds in (Q,-), with 1 <i,j,k <n. If x; and a; are given, then x; is specified 
uniquely in (11.4) by the multiplication - defined on Q. Now suppose that 2; 
and x, are given in Q, so that (11.4) holds. Consider the table row labeled 
by the element x;. Since the body of the table is a Latin square, this row 
contains the element x; exactly once. Let x; be the label of the column in 
which this table entry x, appears. Then 2; is the unique solution of (11.4) 
for the given x; and x,. In similar fashion, it may be shown that (11.4) 
has a unique solution «; for given x; and x, (Exercise 8). Thus (Q,-) is a 
quasigroup. 
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Xi 


Li UG 


FIGURE 11.3: A multiplication table. 
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Theorem 11.5 provides a direct way to construct finite quasigroups. We 
first build a Latin square, and then make it into a quasigroup multiplication 
table by providing it with row and column labels taken from the set of entries 
of the square (Figure 11.4). Although it is customary and convenient to use 
these labels in a specified order (for example, increasing numerical order as 
in Figure 11.4), the labels may be applied in any order to give a quasigroup 


multiplication table. 


D)opy Rl ml; w yr 
me} Od] Ooulr | do] 
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FIGURE 11.4: 
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From a Latin square to a quasigroup multiplication table. 
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There are various approaches to the construction of Latin squares. We 
may certainly take the body of any finite, nonempty group or quasigroup 
multiplication table. Another approach is to build up the Latin square by 
gradually adding entries until the square is complete. At each stage, we must 
make sure that no entries are repeated in any row or column of the partial 
table. At the same time, we may take advantage of the fact that each element 
has to appear somewhere in each row and column. Sometimes, this procedure 
will stall: There will be no way to complete the partial table to a Latin square. 
If this happens, we have to withdraw a step or two, and then try again. 

As an example, consider the problem of building up a 4 x 4 Latin square 
using the set {0,1,2,3} of integers modulo 4. Suppose that we have reached 
the following partial square, which seems to be acceptable since there are no 
repeated elements in any row or column: 


The elements 2 and 3 have to appear somewhere in the first row. The element 
2 cannot appear in the second column of the first row, since 2 already appears 
in the second row of the second column. This forces 3 to be the second entry 
in the first row, leaving 2 to appear in the final entry of the first row: 


0/3)1]2 


Now we encounter the problem that the element 2 is appearing twice in the 
final column. To avoid this problem, we may go back to the original square 
and withdraw the element 2 from the third row of the final column: 


Arguing as before, we reach the square 
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0/3)1] 2 


which may then be completed to a full Latin square, say 


0/3)1]2 
1|2}3 0 
2/1/01] 3 
3/0)2]1 


for instance. 


11.3. Division 


In semigroups, such as the multiplicative structures of fields and rings, 
division is not always possible. For example, we cannot divide by the element 
2 within the ring Z of integers, nor by 0 in the field R of real numbers. On 
the other hand, quasigroups are defined so that division is always possible. In 
fact, there are two forms of division in a quasigroup: from the left, and from 
the right. 


DEFINITION 11.6 (Quasigroup divisions.) Let (Q,-) be a quasigroup. 
Consider elements x and y of Q. 


(a) The element x\y of Q is defined as the unique solution z of the equation 
x: z=y. In other words, 


a-(x\y)=y. (11.5) 


The element x\y may be read as “x dividing y” or “x backslash y.” The 
operation \ on the set Q is known as left division in the quasigroup 


(Qs); 


(b) The element x/y of Q is defined as the unique solution z of the equation 
z-y=a. In other words, 


(x/y)-y=o. (11.6) 


The element x/y may be read as “xc divided by y” or “x slash y.” The 
operation / on the set Q is known as right division in the quasigroup 


(Q,:). 
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Example 11.7 (Subtraction as a right division.) 
Consider the abelian group (Z,+) of integers as a quasigroup, in which the 
“quasigroup multiplication” is given by addition. Equation (11.6) becomes 


(x/y)+y=«. 


Thus the right division z/y in the quasigroup (Z,+) is just the subtraction 
x —y. More generally, subtraction is right division in any additive group. 


Example 11.8 (Division in bit string quasigroups.) 
In the bit string quasigroups (P,,*) of Example 11.4, 


ux(a*xy)=y and (a*y)*y=a2 (11.7) 


(Exercise 12). Thus in this case, z\y = x * y = x/y: The two division 
operations are the same as the quasigroup multiplication. 


Example 11.9 (Divisions in groups.) 
Let (G,-) be a group, considered as a quasigroup. Then 


z\y=a'y and z/y=ay' (11.8) 


(Exercise 13). It is worth recalling the role of the right division in a group, 
as the single operation that is used in the subgroup test (Proposition 4.43, 
page 80; compare Remark 4.44). 


Example 11.10 (Reflection as a quasigroup division.) 

Consider the arithmetic mean quasigroup structure (R,o) on the real line, 
as given in Example 11.3. In solving (11.2) for x in terms of y and z, it was 
shown there that 

z/y=2z-y. 
This operation of right division in the arithmetic mean quasigroup has a 
geometrical interpretation, as the reflection of y in a mirror located at z 
(Figure 11.5). (] 


0 ‘z/y Zz y 


FIGURE 11.5: The reflection z/y of y in a mirror at z. 
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PROPOSITION 11.11 (Properties of divisions.) 
Let (Q,-) be a quasigroup. Consider elements x and y of Q. 


(a) a\(x@-y)=y. (c) (y-a)/a=y. 


(b) w= y/(x\y) . (dq) a= (y/x)\y . 


PROOF Suppose «-y = z. Recall that since (Q,-) is a quasigroup, the 
element y is the unique solution t in Q to the equation 


z-t=2Z. (11.9) 


On the other hand, (11.5) — with z written in place of y — shows that «\z 
is also a solution ¢ to (11.9). Thus x\z = y, as required for (a). Moreover, 
since x - (a\z) = z, the equation x = z/(x\z) follows by (11.6). Replacing z 
by y yields (b). For (c) and (d), which are similar, see Exercise 14. J 


Proposition 11.11 yields a new characterization of quasigroups, often more 
convenient than Definition 11.1. (The standard labeling of the equations in 
the following proposition is justified in Study Project 3.) 


PROPOSITION 11.12 (Characterization of quasigroups.) 
A set Q forms a quasigroup (Q,-) under a multiplication - if and only if it is 
equipped with a left division \ and a right division / such that 


(SL) x-(e\y)=y. (SR) (2/y)-y=o. 


(IL) a«\(@-y)=y. (IR) (y-a)/~=y. 


for all x, y inQ. 


PROOF If (Q,-) is a quasigroup, then (SL) and (SR) are the respective 
defining equations (11.5) and (11.6) for the left and right divisions, while 
Proposition 11.11(a) and (c) provide (IL) and (IR). 

Conversely, suppose that Q is equipped with operations -, \, and / satisfying 
the identities (SL), (IL), (SR), and (IR) of the proposition. Consider the 
equation 

Liy=z (11.10) 


for elements x, y, z of Q. If y and z are fixed, the equation (SR) gives 
(z/y)-y = z, so that x = z/y is a solution to (11.10). If s and ¢ are solutions, 
(IR) gives s = (s-y)/y = z/y = (t-y)/y =t, so the solution is unique. Similar 
use of (SL) and (IL) shows that (11.10) has a unique solution for y when x 
and z are fixed (Exercise 15). Thus (Q,-) is a quasigroup. 
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One further consequence of Proposition 11.11 is that the left and right 
divisions provide new quasigroup multiplications. 


THEOREM 11.13 (Divisions as quasigroup multiplications.) 
Let (Q,-) be a quasigroup, with left division \ and right division /. Then 
(Q,\) and (Q,/) are quasigroups. 


PROOF Consider the equation 
a\y =z (11.11) 


involving elements x, y, z of Q. If x and y are given, then z is determined 
directly and uniquely by (11.11). Now suppose that y and z are given. By 
Proposition 11.11(d), the equation (11.11) has a solution x = y/z. But if 
t\y = z as well as 2\y = 2, then Proposition 11.11(b) gives 


c=y/z=y/(t\y) =¢t, 


so the solution is unique. Similar arguments show that (11.11) has a unique 
solution for y in terms of x and z, so that (Q,\) becomes a quasigroup. Again, 
similar arguments show that (Q, /) is a quasigroup (Exercise 16). 


Example 11.14 (Subtraction modulo 4, revisited.) 

Theorem 11.13 gives an immediate proof for the content of Example 11.2, 
showing that the set Z/4 of integers modulo 4 forms a quasigroup under 
subtraction. As noted in Example 11.7, subtraction is the right division for 
the addition in any additive group like (Z/4,+). 


Another way to obtain new quasigroups is by reversing given quasigroup 
multiplications. Suppose that a multiplication - is given on a set Q. Then the 
opposite multiplication o is defined by 


TOY=Yaz@ 
for elements x and y of Q. The following result is readily checked (Exercise 18). 


PROPOSITION 11.15 
If (Q,-) is a quasigroup, then so is its opposite (Q,°). 


DEFINITION 11.16 (Conjugates of a quasigroup.) Let (Q,-) be a 
quasigroup. Then the conjugates of (Q,-) are the quasigroups (Q,-) itself, its 
opposite (Q,0°), the quasigroups (Q,\) and (Q,/) of Theorem 11.13, and their 
respective opposites (Q, \\) and (Q, //). 
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11.4 Quasigroup homomorphisms 


Just like rings, groups, and semigroups, quasigroups are abstract algebras 
in their own right. As such, they come equipped with their corresponding 
concepts of substructure, homomorphism, and product. 


DEFINITION 11.17 (Subquasigroups.) Let (Q,-) be a quasigroup. 
Then a subquasigroup S of (Q,-) is a subset of Q which forms a quasigroup 
(S,-) under the multiplication - of (Q,-). 


Example 11.18 (Subquasigroups of bit string quasigroups.) 
The bit string quasigroup P, of Example 11.4, taken on the subset 


{1,2,3} = {001, 010, 011} 
of the underlying set 
{1,2,3,4,5,6,7} = {001, 010,011, 100, 101, 110, 111} 
of the quasigroup (P2,*), forms a subquasigroup of (P2,*). The various 


3-element subquasigroups of (P2,*) are displayed by the straight lines and 
curved line in Figure 11.6. 


2 = 010 


1 = 001 5 = 101 4 = 100 


FIGURE 11.6: Subquasigroups of the bit string quasigroup Pp». 
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Subquasigroups of a quasigroup are characterized by closure under the three 
operations of multiplication, left division, and right division. 


PROPOSITION 11.19 (The subquasigroup test.) 
Let S be a subset of a quasigroup (Q,-). Then S is a subquasigroup of (Q,-) 
if and only if 

x-y, «\y, and a«/y liein S (11.12) 


for alla, yin S. 


PROOF § If S is closed under the three operations of multiplication, left 
division, and right division, then Proposition 11.12 shows that S forms a 
quasigroup under the multiplication of (Q,-). Conversely, suppose that (S,-) 
is a quasigroup. In particular, x-y lies in S for x, y in S. Moreover, for given 
y and z in S, the unique solution x to 


Liy=z (11.13) 


in S must agree with the unique solution x to (11.13) in Q. This solution 
is z/y. Thus S is closed under the right division. A similar argument shows 
that S is also closed under the left division (Exercise 23). 


In a quasigroup (Q,-), closure of a subset S under the multiplication alone 
is generally insufficient to make S a subquasigroup. For example, the set 
N of natural numbers is closed under the addition operation in the group 
(Z, +) of integers, but N does not form a subquasigroup there. For a natural 
“nonassociative” example, see Exercise 24. 


DEFINITION 11.20 (Quasigroup homomorphism, isomorphism.) 
Suppose that (P,*) and (Q,°) are quasigroups. 


(a) A function f : P — Q is a quasigroup homomorphism, denoted by 


f:(P,*) > (Q,0°), if 
f(x) 0 f(y) = fla *y) (11.14) 
for all x, y in P. 


(b) If a quasigroup homomorphism f : (P,*) > (Q,0) is bijective, it is 
called an isomorphism. 


(c) Quasigroups P and Q are isomorphic (written P ~ Q) if there is an 
isomorphism between them. 
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Example 11.21 (Inclusion of a subquasigroup.) 
Let P be a subquasigroup of a quasigroup (Q,-). Then the inclusion function 


J: PCQrKH 2 
is a quasigroup homomorphism j : (P,-) > (Q,-). l] 
The following result is an analogue of Proposition 5.5 (page 96). 
PROPOSITION 11.22 (Homomorphisms respect division.) 


Let f : (P,-) — (Q,-) be a quasigroup homomorphism, between quasigroups 
with \ and / as the respective divisions. Then 


f(@)\F(y) = F(a\y) and f(x)/f(y) = fla/y) 
for x and y in P. 
PROOF Since z- (x\y) = y in (P,-), the fact that f is a quasigroup 
homomorphism implies 
f(x): F(a\y) = Fy) 


in Q. However, the unique solution z to the equation f(x)- z= f(y) in Q 


is z = f(x)\f(y). Thus f(x)\f(y) = f(x\y), as required. The proof that 
f:P— Q preserves right divisions is similar (Exercise 26). (] 


PROPOSITION 11.23 (Componentwise quasigroup structure.) 
Let (P,-) and (Q,-) be quasigroups. Then the product set P x Q, equipped 
with the componentwise multiplication 


(x1, 2) * (Yi, Yo) = (1-1, 2° Ye), 
forms a quasigroup (P x Q,-). In this quasigroup, the left division 
(1, ©2)\(y1, y2) = (@1\y1, 22\ya) 


and right division 


(x1, ©2)/(y1, y2) = (@1/y1, F2/y2) 


are given componentwise in terms of the left and right divisions on the factors 


PandQ. 


PROOF Consider the equation 


(x1, @2) - (yi, Y2) = (41, 22) 
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in P x Q. For given (yi, y2) and (21, 22) in P x Q, ie., for given yi, 21 in P 
and yo, z2 in Q, there is a unique solution 


(1, €2) = (21/41, 22/y2) 
in P x Q. The other verifications are similar (Exercise 29). (] 
DEFINITION 11.24 (Product quasigroups.) For quasigroups (P,-) 


and (Q,-), the quasigroup (P x Q,-) of Proposition 11.23 is known as the 
product of the quasigroups (P,-) and (Q,°:). 


es 


FIGURE 11.7: Subquasigroups of the product quasigroup P, x P,. 


Example 11.25 (A product of bit string quasigroups.) 

Consider the bit string quasigroup P, = {1,2,3}. The product quasigroup 
P, x P, is displayed in Figure 11.7. Note that ordered pairs such as (3, 1) 
are written simply as juxtaposed digits like 31. The straight and curved lines 
denote the 3-element subquasigroups, e.g., {11, 23,32} curving round the top 
left-hand corner. Compare with Figure 11.6. 
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11.5 Quasigroup homotopies 


The previous section discussed various algebraic concepts associated with 
quasigroups: substructures, homomorphisms, and products. These concepts 
are very similar to the analogous concepts for semigroups, groups, rings, and 
the other kinds of algebra encountered in this book. For quasigroups, however, 
it turns out that the concepts of homomorphism and isomorphism do not tell 
the full story. Consider the quasigroup (Z/3,—) of integers modulo 3 under 
subtraction. Its full multiplication table is presented in Figure 11.8. 


FIGURE 11.8: Subtraction modulo 3. 


The quasigroup is not associative, for example 


(0-2) -1=1-1=042=0-1=0-(2-1). 


Now consider the bijective function 
g: 2/3 3 2/334 —« (11.15) 


of negation modulo 3. Applying this permutation to the column labels in 
Figure 11.8 yields the addition table for the group (Z/3, +) of integers modulo 
3, displayed (with an unconventional ordering of the columns) in Figure 11.9. 


FIGURE 11.9: Addition modulo 3. 
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Note that the bodies of the multiplication tables in Figures 11.8 and 11.9 
consist of the same 3 x 3 Latin square, even though the second quasigroup is 
associative, while the first is not. Since the associativity property is preserved 
by quasigroup isomorphisms, it is apparent that the same Latin square has 
furnished multiplication tables for quasigroups which are not isomorphic. 

Phenomena such as these are captured by the following definition, which 
may be contrasted with Definition 11.20. 


DEFINITION 11.26 (Quasigroup homotopy, isotopy.) Suppose that 
(P, x) and (Q,°) are quasigroups. 


(a) A triple (f,g,h) of functions f: P>Q,9:P—>Q, andh: PQ is 
a quasigroup homotopy, denoted by 


(f,9,h) : (P,*) > (Q,9), 


if 
f(x) 0 gly) = h(a *y) (11.16) 
for all x, y in P. 


(b) The functions f, g, h of (a) appearing in the homotopy (f,g,h) are 
known as the components of the homotopy. 


(c) If the components of a quasigroup homotopy 
(f,9,h) : (P,*) > (Q,9) 
are bijective, the homotopy is described as an isotopy. 


(d) Quasigroups P and Q are said to be isotopic (written P ~ Q) if there 
is an isotopy between them. 


Example 11.27 (Integers modulo 3.) 

Consider the set Z/3 of integers modulo 3. Define functions f and h to be 
the identity function on the set Z/3. With g : Z/3 — Z/3 as the negation 
(11.15) modulo 3, we have 


f(x) + gy) = h(x — y) 


for integers x, y modulo 3. Thus 


(f,.9,h) : (2/3, —) > (Z/s, +) (11.17) 


is an isotopy. l] 


The distinction between homotopy and homomorphism may be seen on 
comparing (11.16) with (11.14). In particular: 
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PROPOSITION 11.28 (When homotopies are homomorphisms.) 
Suppose that (P,*) and (Q,°) are quasigroups. 


(a) A quasigroup homotopy (f,g,h) : (P,*) > (Q,0°) is a homomorphism if 
its three components agree: f=g=h. 


(b) Each quasigroup homomorphism f : (P,*) > (Q,°) forms a homotopy 
(F,f,f) : (P,*) > (Q,9). 


Composites of homotopies are homotopies. 


PROPOSITION 11.29 (Composites of homotopies.) 
Suppose that (N,-), (P,*), and (Q,°) are quasigroups, with homotopies 
(f,9,h) : (F, *) — (Q,°) and F.g8) : (N,-) —? (P, *). Then 


(fof',gog',hoh’):(N,-) > (Q,9) 


is again a quasigroup homomorphism. 


PROOF By respective use of the homotopy property (11.16) of (f,g,h) 
and (f’,g',h’), we have 


(fo f')(2) o(geg')(y) = F(F(2)) ogg) 


for xz, yin N. l] 


COROLLARY 11.30 (Isotopy as an equivalence relation.) 
Isotopy forms an equivalence relation on any set of quasigroups. 


PROOF By Proposition 11.28(b), the identity map idg on a quasigroup Q 
forms an isotopy. Thus the relation of isotopy is reflexive. Now suppose that 
N~ Pand P ~ Q for quasigroups N, P, and Q. By Proposition 11.29, the 
composite of respective isotopies from N to P and from P to Q is an isotopy 
from N to Q, so the relation of isotopy is transitive. Finally, suppose that 
(f,9,h) : (P,*) — (Q,°) is an isotopy. Consider elements x and y in Q, say 
with f(z’) = x and g(y’) = y for unique elements x’, y’ of P. By (11.16), we 
have 
h(a! «y') = f(a’) ogy’) =xoy. 

—l(2) * g-'(y), as required to show that 
is an isotopy. It follows that the relation 


4 
=p 
S 
mn 
Pa 
a 
— 
8 
O° 
= 
II 
=e 
* 
—< 
Key 


of isotopy is symmetric. l] 
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11.6 Principal isotopy 
In order to simplify the concept of isotopy as much as possible, the following 


definition is useful. 
DEFINITION 11.31 (Principal isotopy.) 
(a) A quasigroup isotopy 
(f,9,h) : (P,*) > (Q,°) 
between quasigroups (P,*) and (Q,0°) is said to be a principal isotopy if 


its third component h is the identity map idp : P — P on the set P (and 
thus in particular, if the domain set P and codomain set Q coincide). 


(a) Two quasigroup structures (Q,*) and (Q,°) on a common underlying 
set Q are said to be principally isotopic if there is a principal isotopy 


(f,g,idg) : (Q,*) > (Q,9). 


Example 11.32 
The isotopy (11.17) of Example 11.27 is a principal isotopy. l] 


To within isomorphism, every isotopy is principal: 


PROPOSITION 11.33 (Factorizing an isotopy.) 
Consider a quasigroup isotopy (f,g,h) : (P,*) > (Q,°). 
h: PQ to induce a multiplication 


roy=h'(h(z) oh(y)) 


Use the bijection 


for xz, y in P. 
(a) The structure (P,0) is a quasigroup. 
(b) There is an isomorphism h : (P,°o) > (Q,0°). 
(c) The isotopy (f,g,h) factorizes as the composite 
(f,9,h) =(h,h,h) 0 (h7* 0 f,h7 0g, idp) 


of the principal isotopy (h~1 0 f,h~1 0 g,idp) : (P,*) = (P,0) together 
with the isomorphism h : (P,o) > (Q,°). 
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Verification of the straightforward details in Proposition 11.33 is assigned as 
Exercise 39. The composite isotopy in Proposition 11.33(c) may be expressed 
symbolically as (P, x) ~ (P,0) = (Q,0°). 

Principal isotopy clarifies the relationship between the various quasigroups 
obtained with a given Latin square as the body of their multiplication table. 


THEOREM 11.34 (Bordering a Latin square.) 

Let Q be a finite set. Then two quasigroups (Q,*) and (Q,-) share a Latin 
square L(Q) built on Q as the common body of their multiplication tables if 
and only if they are related by a principal isotopy (f,g,idg) : (Q,*) > (Q,-). 


PROOF Suppose that @ has n elements 21,...,2%p. First, suppose that 
(Q,*) and (Q,-) share a Latin square L(Q) built on Q as the common body 
of their multiplication tables. In other words, there are permutations r’, c’, 
r, and c of the set Q such that 


r(Zn) 


is a multiplication table of (Q,*) and 


(an) 


is a multiplication table of (Q,-). Then for €, 7 in Q, we have 


r'(€)-¢'(m) = r(€) *e(m). 
Substituting € = r~!(x) and 7 = c7(y), we obtain 


r(r-*(a)) (ey) = ay (11.18) 


for z, y in Q. Define new permutations f = r’or7! and g = coc! of Q. 


The equation (11.18) becomes f(x) - g(y) = x * y for x, y in Q, yielding the 
principal isotopy (f,g,idg) : (Q,*) — (Q,-). 
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Conversely, suppose there is a principal isotopy (f,g,idg) : (Q, *) — (Q,°:). 
Thus 
f(x) -9y) =a¥y (11.19) 


for elements x, y of Q. Let L(Q) be the Latin square on Q which forms the 
body of the multiplication table 


In 


of (Q,*). Then by (11.19), the multiplication table of (Q,-) is 


Thus (Q,*) and (Q,-) share the Latin square L(Q) as the common body of 
their multiplication tables. l] 


11.7 Loops 


Semigroups with an identity element are monoids. Quasigroups with an 
identity element are called loops. 


DEFINITION 11.35 (Loops, identity element.) <A quasigroup (Q,-) 
is said to be a loop if it contains an element e such that 


€-LHUM=2E 


for all elements x of Q. The element e of Q is called the identity element of 
the loop (Q,-,e). 
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Groups are certainly loops. Although it is not easy to find natural examples 
of nonassociative loops, each Latin square is the body of a multiplication table 
of a loop. 


PROPOSITION 11.36 (Latin squares are loop tables.) 

Let Q be a finite, nonempty set. Let L(Q) be a Latin square built from the 
elements of Q. Then L(Q) is the body of the multiplication table of a loop 
(Q,:,e) on the underlying set Q. 


PROOF Suppose the Latin square is 


U11 %12... Vin 

U1 QQ... Lan 
LQ)=]. . eis 

Enl inQ.--- Unn 


so that the n-element set Q is given as 


Q= APG Dots nc nay = a May he Diy bx 


Then the bordered version 


T11 Tin 

U1 T11 Tin 
L(Q) 

Trl Trl 7" Tnn 


of the Latin square L(Q) is the multiplication table of a loop (Q,-,211) on 
the underlying set Q, with x11 as the identity element. l] 


Example 11.37 (Subtraction and addition modulo 3.) 

Take Q to be the set of integers modulo 3, and take L(Q) to be the body of 
the table of (Z/3,—), as illustrated in Figure 11.8. Then the addition table 
modulo 3, as displayed in Figure 11.9, exhibits the construction of the proof 
of Proposition 11.36. 


Proposition 11.36 shows that each finite, nonempty quasigroup is principally 
isotopic to a loop (Exercise 40). However, there is a more direct and general 
argument. 
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THEOREM 11.38 (Quasigroups are isotopic to loops.) 
Let (Q,-) be a nonempty quasigroup, with left division \ and right division /. 
Let a and b be elements of Q. Define a new multiplication o on the set Q by 


roy = («/b) - (a\y) (11.20) 


for x, y inQ. Then (Q,0,a- b) is a loop that is principally isotopic to the 
quasigroup (Q,-). 


PROOF By Proposition 11.11(a) and (11.6), we have 
xo (a-b) = (a/b) - (a\(a-b)) = (a/b) -b=2 

for x in Q. Similarly, by Proposition 11.11(c) and (11.5), we have 
(a-b) oe = ((a-b)/b) -(a\z) =a: (a\z) =2 


for z in Q. Thus (Q,0,a- b) is a loop. 
Now define 

a:Q-Qyra-y 
and 

B:Q—-Q;r4H2-b. 
The maps a and £ are bijective, with corresponding inverses 

a": Q>Q:yra\y 
and 

BO Ore a/b 
(Exercise 41). By (11.20), the triple 

(B-',a~*,idg) : (Q,°) as (Q, -) 


is an isotopy. Thus 
(8, a, idg) : (Q, -) =a (Q,°) 
is the required principal isotopy from (Q,-) to the loop (Q,0,a- b). l] 


It is natural to ask why the concept of isotopy does not arise in the study 
of groups. The following theorem and its corollary provide an answer. 


THEOREM 11.39 (Loop isotopes of groups are groups.) 
If a loop is isotopic to a group, then it is isomorphic to that group. 


PROOF It suffices to consider the case of a principal isotopy 
(f, g; idg) : tO; *, ex) a (Q, °, €o) 
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from a loop structure (Q, *,e.) on a set Q to a group structure (Q,0,e.) on 
Q. Thus 


f(x) ogy) =a¥y (11.21) 
for elements x, y of Q. Setting y = e, in (11.21) yields 
f(x) ° gles) = Te, = 2, 


so that 
f(x) = xog(e.)* 


in the group (Q,°, eo). Similarly, setting x = e, in (11.21) yields 
flex) 0 g(y) = ex FY =Y, 


so that 
gly) = flex) boy 


in (Q,°,e.). Equation (11.21) may now be rewritten in the form 
©0g(ex)~'0 flex) oy = (a*y) 


within the group (Q,0,e.). Multiplying from the left by f(e,.)~', and from 
the right by g(e.)~1, we obtain 


Flex)" ovo gles)" 0 flex)! oye gles) 


- 2 (11.22) 
= f(ex)"* 0 (a*y) 0 gles)". 


Consider the invertible map 
9:Q > Qa flex) hoa gleg) 
(compare Exercise 42). Written in terms of 0, the equation (11.22) becomes 
6(x) 0 O(y) = O(a xy). 
Thus 0: (Q,*, ex) + (Q,0, eo) is the required isomorphism. l] 


COROLLARY 11.40 (Isotopic groups.) 
If two groups are isotopic, then they are isomorphic. 


The final concern of this chapter is to resolve a critical issue that arose in 
Section 11.2: 


Can each Latin square be given suitable row and column labels so 
that it becomes the body of a group multiplication table? 
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(A positive answer would suggest that the study of quasigroups could be 
reduced to a study of groups.) By Theorem 11.34, the question is equivalent 
to asking whether each finite quasigroup is principally isotopic to a group. 
By Theorem 11.38 and the transitivity of the isotopy relation, the question 
reduces to asking whether each finite loop is principally isotopic to a group. 
Finally, by Theorem 11.39, the question becomes: are there any finite loops 
that are not associative? 

There is a unique loop with identity 0 on the set {0,1}, the group (Z/2, +). 
Now consider a loop of order 3, on the set {0,1,2}. With the natural ordering 
of the row and column labels, the body of the multiplication table becomes 
the incomplete Latin square 

012 
la 
2 


There are apparently two choices for the element a, namely 0 or 2. However, 
in the former case, there is no way to complete the Latin square, since the 
completion procedure stalls at 

012 

102 

2 


On the other hand, choosing a = 2 forces a unique completion to the Latin 
square 

012 

120 

201 


that gives the multiplication table of the group (Z/3,+). So loops of order 
3 are associative. In Exercise 43, you are asked to apply similar techniques 
to show that each loop of order 4 is associative. However, the loop whose 
multiplication table is displayed in Figure 11.10 is not associative, since any 
group of order 5 is commutative. 


wo nM FE OO 
Rw nm HF OO 
wn Oo PP 
FE Oo fk Ww bw 
wD oR FEF Oo w 
Cor wWNTW 


FIGURE 11.10: A nonassociative loop of order 5. 
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11.8 Exercises 


1. 


Show that the integers form a nonassociative quasigroup (Z,—) under 
subtraction. 


. The geometric mean of two positive real numbers xz and y is 


vey = fry. (11.23) 


Show that under the multiplication « of (11.23), the set of positive real 
numbers forms a nonassociative quasigroup. 


. Let Q be the set of negative real numbers. 


(a) Show that (11.23) is defined for x, y in Q. 

(b) Show that (11.23) does not give a quasigroup multiplication on Q. 

(a) Show that for a natural number r, the set P,. of nonzero bit strings 
of length r + 1 is closed under the multiplication * of (11.3). 


b) Show that for a positive integer r, the quasigroup P, is not a group. 
gS & 
(Hint: Consider the properties of the identity element of a group.) 


. Write out the multiplication tables for the quasigroups P; and P» of 


Example 11.4. 


. Find values for the unknowns a, b, c, d from the integers modulo 4 so 


that 
a boc  d 
0 0 3 2 1 
1 1 0 3 2 
2 2 1 0 8 
3 3 2 1 O 


becomes the addition table for the group (Z/4, +). 


. Let (Q,-) be a finite, nonempty quasigroup. Show that the body of the 


multiplication table of (Q,-) forms a Latin square. 


. Complete the proof of Theorem 11.5. 


. Without directly using tables of group addition or subtraction, construct 


a 5 x 5 Latin square. 
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FIGURE 11.11: An intercalate in a Latin square. 


10. 


11. 


12. 


13. 
14. 
15. 
16. 


In a Latin square, an intercalate is a configuration of four entries at the 
intersection of two rows and two columns, containing just two distinct 
elements a and b (Figure 11.11). 


(a) Show that interchanging the entries a and 6 of an intercalate within 
one Latin square creates a new Latin square. 


(b) Let t be an element of a finite group G, with t? = 1 4 t. Show 
that the multiplication table body of G contains an intercalate with 
entries 1 and t. 


(c) Use intercalates to create new quasigroups of order 6 from each 
group of order 6. Which of the new quasigroups are not associative? 


Let (A,+) be an additive group, considered as a quasigroup with + as 
the quasigroup multiplication. Write the corresponding left division in 
terms of addition, subtraction, and negation. 


Verify that (11.7) holds in each bit string quasigroup P,. (Hint: There 
are two cases to consider, « = y and x F y.) 


Verify that the equations (11.8) hold in a group. 
Prove Proposition 11.11(c),(d). 
Complete the proof of Proposition 11.12. 


Complete the proof of Theorem 11.13. 


17. 


18. 
19. 


20. 


2A 
22. 


23. 
24. 


25. 


26. 
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Suppose that (Q,-) is the quasigroup with multiplication table 


0 1 2 3 
0 0 3 1 2 
1 1 2 3 0 
2 2 1 0 3 
3 3 0 2 #1 


Determine the multiplication tables for the quasigroups (Q, \) and (Q, /). 
Verify Proposition 11.15. 


A quasigroup (Q,-) is said to be commutative if x-y = y-wx for all x, yin 
Q. Show that a quasigroup is commutative if and only if right division 
is the opposite of left division. 


Show that a bit string quasigroup P, (compare Example 11.4) coincides 
with each of its conjugates. 


How many distinct conjugates does the group (Z, +) of integers possess? 
Consider Figure 11.6. 
(a) How many distinct 3-element subquasigroups are displayed in the 
figure? 


(b) Show that knowledge of the 3-element subquasigroups, along with 
the observation that « * « = x for each element x, specifies the 
multiplication * in (P2, *) completely. 


Complete the proof of Proposition 11.19. 


Consider the closed unit interval J = [0,1], the set of real numbers from 
0 to 1. 


(a) Show that I is closed under the multiplication o of the arithmetic 
mean quasigroup (R,o) of Example 11.3. 


(b) Show that I does not form a subquasigroup of the arithmetic mean 
quasigroup (R,o). 


Let (G,-) be a group, and let S be a nonempty subset of G. Show that 
S forms a subgroup of (G,-) if and only if it forms a subquasigroup of 
the quasigroup (G,-). 


In the context of Proposition 11.22, show that the quasigroup homo- 
morphism f : (P,-) — (Q,-) preserves right divisions. 
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27. 


28. 


29. 
30. 


3l. 


32. 


33. 


34. 
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Show that the arithmetic mean quasigroup (R,o) of Example 11.3 is 
isomorphic to the geometric mean quasigroup of Exercise 2. 


Suppose that a quasigroup Q is isomorphic to a group G. Show that Q 
is associative. 


Complete the proof of Proposition 11.23. 


Let P and Q be quasigroups. Show that the projections 
pi: PX Q; (21,22) 21 
and 
p2: Px Q; (#1, 22) > 22 
are quasigroup homomorphisms. 
Let X be a set, and let (Q,-) be a quasigroup. Show that the set Q* of 


functions f : X — Q from X to Q carries a componentwise quasigroup 
structure (Q*,-), with 


(f+ 9)(x) = F(x): g(x) 
for f, g in Q* and «x in X. 
Consider Figure 11.7. 


(a) How many distinct 3-element subquasigroups are displayed in the 
figure? 

(b) Show that knowledge of the 3-element subquasigroups, along with 
the observation that « * « = x for each element x, specifies the 
multiplication * in (P; x P,,*) completely. 


Consider the product of the arithmetic mean quasigroup (R,o) with 
itself — compare Example 11.3. 


(a) Give a geometric interpretation of the product quasigroup structure 
(R?,0) on the Cartesian plane R?. 

(b) Give a geometric interpretation of right division in the product 
quasigroup structure (R?,o) on the Cartesian plane R?. (Hints: 
Compare Example 11.10. Recall the two types of reflection in the 
plane, point reflections and line reflections.) 


Consider the quasigroup (Z/3, —) of integers modulo 3, with subtraction 
as the quasigroup multiplication. Show that the product 


(Z/3,—) x (Z/2, +) 


of (Z/3,—) with the cyclic group (Z/2,+) is isomorphic to the quasi- 
group (Z/6,—) of integers modulo 6 under subtraction. 


35. 


36. 


37. 


38. 


39. 


40. 


Al. 


42. 


43. 


44. 
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Show that the quasigroup of integers under subtraction is isotopic to 
the group of integers under addition. 


Show that the arithmetic mean quasigroup (R,o) of Example 11.3 is 
isotopic to the additive group (R,+) of real numbers. 


Show that the conjugates of a group are isotopic. 


Let Q be aset. Show that principal isotopy forms an equivalence relation 
on the set of quasigroup structures (Q,-) on Q. 


Verify the details of Proposition 11.33. 


Use Proposition 11.36 to show that each finite, nonempty quasigroup is 
principally isotopic to a loop. 


In the proof of Theorem 11.38, show that the maps a and @ are bijective. 


Show that the map @: Q — Q used in the proof of Theorem 11.39 is 
invertible. 


Show that each 4 x 4 Latin square is the body of the multiplication table 
of a group. 


Use intercalates (compare Exercise 10) to show that there are other 
nonassociative loops on the set {0,1,2,3,4}, besides the one displayed 
in Figure 11.10. 


11.9 Study projects 


1. 


Quasigroups and Latin squares as experimental designs. 


(a) A housing association is conducting an experiment to determine 
the best kind of wall siding to use for its houses: concrete, metal, 
plastic, or wood. For the experiment, it has houses, numbered 
1, 2, 3, 4, at four different locations, with different climates and 
atmospheric conditions. Each house has walls facing in each of 
the cardinal directions: north, south, east, and west. How should 
the different kinds of siding be applied for the experiment, so that 
each kind of siding is tested on each house, and on each direction 
of wall? 


Set up a bordered 4 x 4 Latin square to plan how the experiment 
should be conducted. The house addresses from 1 to 4 should label 
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the rows. The four directions should label the columns. The body 
should be a Latin square on the 4-element set 


{concrete, metal, plastic, wood} 


of siding types. The table entry in the row labeled 7 and column 
labeled d should indicate which type of siding is to be applied to 
the wall facing direction d on house number 7. 


There are 7 students enrolled in a one-quarter algebra class. In 
each of 7 weeks of the course, the instructor wishes to designate 
a group of 3 students to prepare a special presentation. In order 
to assess everybody fairly, each student should be grouped exactly 
once with each other student. Use the bit-string quasigroup P2 of 
Figure 11.6 to prepare an assignment plan for the instructor. Note 
that each student is involved in 2 different group presentations. 
Repeat the exercise of (b) for the case of 9 students in 12 groups of 
3, during a one-semester course. Which quasigroup should be used 
in this case? In how many presentations is each student involved? 


If n students are to be assigned to t groups of 3, with each pair of 
students appearing in exactly one group as in (b) and (c) above, 


show that 
n—-1 = 


= 
2 


and 
n(n — 1) 


2 


Conclude that an assignment plan for n students is only possible 
if n is congruent to 1 or 3 modulo 6. 


= 3t. 


2. Orthogonal Latin squares. 


(a) 


Let Q@ be a nonempty set with a finite number n of elements. Two 
Latin squares L;(Q) and L2(Q) on the set Q are said to be (mutually) 
orthogonal, if for each ordered pair (x1, 22) of elements of Q, there are 
unique integers 1 < 7,7 < n such that for k = 1,2, the element zz 
appears in the i-th row and j-th column of L;,(Q). A pair of orthogonal 
Latin squares is displayed in Figure 11.12. 


Let (Q,*) and (Q,o) be quasigroup structures on the given set 
Q. Suppose that L,(Q) and L.(Q) are the respective bodies of 
the multiplication tables of (Q,*«) and (Q,0°), presented with a row 
and column labeling that is the same for each table. Show that the 
Latin squares L,(Q) and L,(Q) are orthogonal if and only if the 
function 


QxQ-QxQ; (zy) (xy, roy) 


is bijective. 
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FIGURE 11.12: A pair of orthogonal Latin squares. 


(b) What quasigroup structures on the 3-element set Z/3 of residues 
modulo 3 correspond to the orthogonal Latin squares displayed in 
Figure 11.12? 


(c) Let p be a prime number. Let J and m be distinct nonzero residues 
modulo p. Show that the quasigroups (Z/,, *) with 
rey=atly 
and (Z/,,0) with 
roy=xr+my 
yield mutually orthogonal Latin squares on Z/y. 


(d) Let F' be a finite field. Let 1 and m be distinct nonzero elements 
of F’. Show that the quasigroups (F,*) with 


rey=atly 


and (F,0) with 
roy=xr+my 


yield mutually orthogonal Latin squares on F’. 


3. Left and right multiplications. 
Let (Q,-) be a quasigroup. By analogy with (5.16), define a map 


Agi: Qe Qrreg-z (11.24) 


for each element q of Q. This map is known as left multiplication by the 
element q. Similarly, define the right multiplication 


Pq: Q7Q;tru2x-g (11.25) 
for each element q of Q. 


(a) Show that the respective identities (IL), (IR) of Proposition 11.12 
imply the injectivity of the left and right multiplications Az, pz. 


(b) Show that the respective identities (SL), (SR) of Proposition 11.12 
imply the surjectivity of the left and right multiplications Az, py. 
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(c) Conclude that in the quasigroup (Q,-), each left multiplication 
(11.24) and right multiplication (11.25) is a permutation of Q. 


(d) Consider the case where (Q, -) is the quasigroup (Z/,,, —) of integers 
modulo a positive integer n, under subtraction. Show that the set 


{Acs Px | « in Z/n} 


forms a group of permutations on Z/,,, isomorphic to the dihedral 
group D,, of Study Project 3 in Chapter 4. 


(e) Show that a quasigroup (Q,-) is associative if and only if the map 
AtQ + OQhai ag 


is a quasigroup homomorphism. 


11.10 Notes 
Section 11.3 


The symbols / and \ are often used within mathematical software in the 
same sense as in Definition 11.6. Thus if A and B are invertible (square) 
matrices, A/B may denote the matrix AB~', while A\B is used for A~'B — 
compare (11.8). The notation is extended to denote solutions to equations. 
For example, the solution x of the vector equation Ax = y is written as 
x= A\y. 

Conjugates of a quasigroup are sometimes described as “parastrophes.” 


